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HOMOLQGOUST.Y RECQMB TNANT SLOW GROWING 
f4Vrr>BACTT^T?TA AND TT.qp.S THEREFOR 

Dp.scription 

ytaffVaround of the Invention 
5 The World Health Organization estimates that one in 

three human beings is believed to be infected with Myco- 
bacterium Mib«»rculosis (Styblo, K., Reviews of Infectious 
p-i«>.;.sps. vol. II . Suppl. 2, March-April, 1989; Bloom and 
Murray, Science 257 : 1055-1067, 1992). Over the past 

10 decade, there has been a recent resurgence in the inci- 
dence of tuberculosis in developed countries that has 
coincided with the AIDS epidemic (Snider and Roper, 
TTncrland J. Med. 326 :703-705 (1992)). Because of their 
impact as major human pathogens and as a result of their 

15 profound immunostimulatory properties, mycobacteria have 
long been intensively studied. In the early 190 0s, an 
attenuated mycobacterium, MYf^nhacterium (M. ) hnvis Bacil?.^ 
ralmette-rtiierin (M. bovis BCG or BCG) , was isolated for 
use as a vaccine against tiaberculosis (Calmette ^ sX- 

20 Acad. Natl Med. (Paris) . 91.:787-796, 1924; reviewed in 
Collins, F.M., Bacterial Vaccines (R. Germanier, ed.). 
Academic Press, pp. 373-418, 1984). Although the efficacy 
of this vaccine against tuberculosis varied considerably 
in different trials, and the reasons for its variable 

25 efficacy have yet to be resolved, BCG is among the most 
widely used human vaccines (Luelmo, F. , Am. Rev. Respir. 
Pis. 125 :70-72, 1982; Fine, P.E.M., Reviews of Infectious 
Diseases TT (sudp. 2) . 5353-5359, 1989) . 

The recent application of molecular biological tech- 

3 0 nology to the study of mycobacteria has led to the identi- 
fication of many of the major antigens that are targets of 
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the immune response to infection by mycobacteria (Kauf- 
mann, S.H.E., Immunol. Today 11:129-136, 1990; Young, 

Ann. Rev. Immunol. 8:401-420, 1990; Young et al. , 
Academic Press Ltd. , London, pp. 1-35, 1990; Young et al. , 
5 Mol. Microbiol. 6:133-145, 1992)) and to an improved 
understanding of the molecular mechanisms involved in 
resistance to antimycobacterial antibiotics (Zhang et al. , 
Nature 258:591-593, 1992; Telenti et al . , Lancet 341:647- 
650, 1993) . The development of tools that permit mole- 

10 cular genetic manipulation of mycobacteria has also al- 
lowed the construction of recombinant BCG vaccine vehicles 
(Snapper et al . , Proc . Natl. Acad. Sci . USA 85:6987-6991, 
1988; Husson et al , , J. Bacterid. 172:519-524, 1990; 
Martin et al . , B. Nature 345 :739-743, 1990; Snapper et 

15 al . , Mol. Microbiol. 4:1911-1919, 1990; Aldovini and 

Young, Nature 351:479-482, 1991; Jacobs et al . , Methods 
Enzvmol . 204:537-555, 1991; Lee et al , , Proc. Natl. Acad. 
Sci . USA 88-3111-3115, 1991; Stover et al . , Nature 351: 
456-460, 1991; Winter et al , , Gene 109 :47-54, 1991; 

20 Donnelly-Wu et al . , Mol . Microbiol . 7:407-417, 1993)) . 
Genome mapping and sequencing projects are providing 
valuable information about the M^ tuberculosis and M^ 
leprae genomes that will facilitate further study of the 
biology of these pathogens (Eiglmeier et al . , Mol. Micro- 

25 biol . , in press, 1993; Young and Cole, J . Bacteriol . 
175 :1-6, 1993) . 

Despite these advances, there are two serious limita- 
tions to our ability to manipulate these organisms geneti- 
cally. First, very few mycobacterial genes that can be 

30 used as genetic markers have been isolated (Donnelly-Wu et 
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al,, Mol. Microbiol. 7:407-417, 1993)). In addition, 
investigators have failed to obtain homologous recombina- 
tion in slow growing mycobacteria, such as tuberculosis 
and bovis BCG (Kalpana et al., Proc. Natl. Acad, Sci. 
5 USA 88.: 5433 -5447, 1991; Young and Cole, J. Bacterid. 
175 ; 1-6 , 1993)), although homologous recombination has 
been accomplished in the fast growing Mycobacterium smeg- 
mat is (Husson et al . . J. Bacteriol. 172: 
519-524, 1990) ) . 

10 Sumtnarv of the Invention 

Described herein is a method of transforming slow- 
growing mycobacteria, such as bovis BCG, leprae, M^ 
tuberculosis M. avium, intracellulare and af ricanum; 

a method of manipulating genomic DNA of slow- growing 

15 mycobacteria through homologous recombination; a method of 
producing homologously recombinant (HR) slow-growing 
mycobacteria in which heterologous DNA is integrated into 
the genomic DNA at a homologous locus; homologously recom- 
binant (HR) slow-growing mycobacteria having heterologous 

20 DNA integrated into their genomic DNA at a homologous 

locus; and mycobacterial DNA useful as a genetic marker. 

Applicants have succeeded in introducing heterologous 
DNA into (i.e., transforming) slow-growing mycobacteria 
through the use of electroporation in water (rather than 

25 in buffer) . In the present method of transforming slow- 
growing mycobacteria, heterologous DNA (such as linear DNA 
or plasmid DNA) and slow-growing mycobacteria (e.g., M^ 
bovis BCG, iL. leprae , M, tuberculosis avium, M^ 
intracellulare and M^ af ricanum ) are combined and the 
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resulting combination is subjected to electroporation at 
an appropriate potential and capacitance for sufficient 
time for the heterologous DNA to enter the slow growing 
mycobacteria, resulting in the production of transformed 
5 mycobacteria containing the heterologous DNA, In one 

embodiment, heterologous DNA and bovis BCG are combined 
and subjected to electroporation in water. In a particu- 
lar embodiment, the bovis BCG-heterologous DNA combina- 
tion is subjected to electroporation in water at settings 
10 of approximately 2 . 5kV potential and approximately 25 fiF 
capacitance. Optionally, prior to harvest, cells to be 
transformed are exposed to glycine (such as by adding 1-2% 
glycine to culture medium in which the slow-grow mycobac- 
teria are* growing) in order to enhance or improve trans- 
15 formation efficiencies. In one embodiment, 1.5% glycine 
is added to the culture medium 24 hours prior to harvest- 
ing of the cells, which are then combined with heterolo- 
gous DNA to be introduced into the slow-growing mycobacte- 
ria. The resulting combination is subjected to electro- 
20 poration, preferably in water, as described above. 

In a further embodiment of the method of transforming 
slow growing mycobacteria, cultures of the cells are 
maintained in (continuously propagated in) mid- log growth, 
in order to increase the fraction of cells which are 
25 undergoing DNA synthesis (and which, thus, are competent 
to take up heterologous DNA) . Cultures of cells main- 
tained in log-phase growth are subjected to electropo- 
ration, preferably in water and, as a result, are trans- 
formed with the heterologous DNA. As described above, 
efficiency of transformation can be increased by exposing 
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the slow-growing mycobacteria to glycine prior to electro- 
poration. Thus, in this embodiment, slow-growing mycobac- 
teria in log-phase growth are combined with heterologous 
DNA (e.g., plasmid DNA, linearized DNA) to be introduced 
5 into the slow-growing mycobacteria. The resulting combi- 
nation is subjected to electroporation (preferably in 
water), under conditions (potential and capacitance set- 
tings and sufficient time) appropriate for transformation 
of the cells. Optionally, prior to electroporation, the 
10 log-phase cells are exposed to glycine (e.g., approximate- 
ly 1-2% glycine added to culture medium) in order to en- 
hance transformation efficiency. 

Heterologous DNA introduced into slow-growing myco- 
bacteria is DNA from any source other than the recipient 
15 mycobacterium. It can be homologous to DNA present in the 
recipient mycobacterial genomic DNA, nonhomologous or 
both. DNA which is homologous to mycobacterial genomic 
DNA is introduced into the genomic DNA by homologous 
recombination or integration. Alternatively, the heterol- 
20 ogous DNA introduced by the present method can be nonhomo- 
logous and, thus, enter mycobacterial genomic DNA by 
random integration events or remain extrachromosomal 
(unintegrated) after it enters the mycobacterium. In 
addition, in one embodiment of the present method, non- 
25 homologous DNA linked to or inserted within DNA homologous 
to genomic DNA of the recipient mycobacterium is intro- 
duced into genomic DNA of the recipient mycobacterium as a 
result of homologous recombination which occurs between 
genomic DNA and the homologous DNA to which the nonhomo- 
logous DNA is linked (or in which it is inserted) . For 
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example, as described herein, a mycobacterial gene which 
encodes a genetic marker has been identified and isolated 
and used to target homologous integration of heterologous 
DNA (DNA homologous to genomic DNA of the mycobacterial 
5 recipient, alone or in conjunction with DNA not homologous 
to genomic DNA of recipient mycobacteria) into genomic DNA 
of a slow-growing mycobacterium. Specifically, the 
bovis BCG gene encoding orotidine-5- monophosphate decarb- 
oxylase (OMP DCase) (uraA) has been isolated, as has DNA 

10 flanking OMP DCase. The OMP DCase gene and the flanking 

DNA have been sequenced. The mycobacterial DNA containing 
the uraA locus, modified to contain heterologous DNA (a 
selectable marker gene) has been used to carry out inte- 
gration of the heterologous DNA (the mycobacterial DNA and 

15 the selectable marker gene) into mycobacterial genomic 

DNA, resulting in production of homologously recombinant 
mycobacteria containing the heterologous DNA of a homolo- 
gous locus. Specifically, bovis BCG DNA containing the 
uraA locus and flanking sequences was modified to replace 

2 0 the OMP DCase coding sequence with the Kan"" selectable 

marker gene (aph) . The resulting construct, which includ- 
ed approximately 1.5 kb ura A flanking sequences on each 
side of the selectable marker gene, was transformed into 
M. bovis BCG, using the method described above. bovis 

25 BCG cultures in mid-log growth were subjected to electro- 
poration in water, resulting in transformation of cells 
with the construct. Transf ormants were selected for 
further study, which showed that all transf ormants as- 
sessed contained vector DNA integrated into the genome and 

30 that in some of the transf ormants , the transforming DNA 
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had integrated at the homologous genomic locus. Thus, 
heterologous DNA of interest has been introduced into 
genomic DNA of slow-growing mycobacteria through homolo- 
gous recombination, to produce homologously recombinant 
5 slow-growing mycobacteria in which the heterologous DNA is 
integrated into the homologous genomic locus (a genomic 
locus homologous to at least a portion of the heterologous 
DNA) , 

Heterologous DNA which includes DNA homologous to 

10 genomic DNA of the recipient mycobacterium (homologous 

DNA) and DNA which is not homologous to genomic DNA of the 
recipient mycobacterium (nonhomologous DNA) can be intro- 
duced into (transformed into) slow growing mycobacterium 
by the present method for several purposes. As described 

15 herein, heterologous nonhomologous DNA encoding a product 
to be expressed by the resulting homologously recombinant 
slow-growing mycobacterium has been introduced into myco- 
bacterial genomic DNA at a locus homologous with addition- 
al sequences to which the nonhomologous DNA is linked. In 

20 this embodiment, the DNA construct transformed into recip- 
ient slow-growing mycobacteria comprises homologous DNA, 
which directs or targets introduction of the heterologous 
DNA into the homologous locus of the mycobacterial genome, 
and nonhomologous DNA, which is expressed in transformed 

25 homologously recombinant mycobacteria. In this embodi- 
ment, the nonhomologous DNA is introduced into mycobacte- 
rial genomic DNA in such a manner that it is added to the 
genomic DNA or replaces genomic DNA. In a second embodi- 
ment, heterologous DNA integrated into genomic DNA is not 

3 0 expressed in the recipient cells. In this embodiment, the 
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DNA construct includes homologous DNA for targeting into a 
homologous genomic locus and DNA which acts to knock out 
(inactivate) or activate a resident mycobacterial gene. 
In the case of inactivation, the mycobacterial gene is 
5 "knocked out", in the sense that it is rendered inactive 
by addition of DNA whose presence interferes with its 
ability to function, by removal or replacement of se- 
quences necessary for it to be functional or by its com- 
plete removal from the mycobacterial genome. In the case 

10 of activation, the heterologous DNA integrated into the 
genomic DNA turns on or enhances expression of a mycobac- 
terial gene, such as by introducing a heterologous promot- 
er which controls the mycobacterial gene expression. In 
the embodiment in which heterologous DNA affects expres- 

15 sion of an endogenous mycobacterial gene, the homologous 
DNA can serve both functions (i.e., the targeting and 
inactivat ion/activating functions) ; if that is the case, 
the DNA construct includes only homologous DNA, Alterna- 
tively, the DNA construct can include homologous DNA (for 

20 targeting purposes) and nonhomologous DNA (for altering 
function of the mycobacterial gene) . 

Homologously recombinant slow- growing mycobacteria of 
the present invention are useful, for example, as vehicles 
in which proteins encoded by the heterologous nonhomo- 

25 logous DNA are expressed. They are useful as vaccines, 
which express a polypeptide or a protein of interest (or 
more than one polypeptide or protein) , such as an antigen 
or antigens of one or more pathogens against which pro- 
tection is desired (e.g., to prevent or treat a disease or 

3 0 condition caused by the pathogen) . Pathogens of interest 
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include viruses, retroviruses, bacteria, mycobacteria, 
other microorganisms, organisms or substances (e.g., 
toxins or toxoids) which cause a disease or condition to 
be prevented, treated or reversed. The homologously 
5 recombinant slow-growing bacteria can also be used to 

express enzymes, immunopotentiators , lymphokines, pharma- 
cologic agents, antitumor agents (e.g., cytokines), or 
stress proteins (useful for evoking or enhancing an immune 
response or inducing tolerance in an autoimmune disease) . 

10 For example, homologously recombinant slow-growing myco- 
bacteria of the present invention can express polypeptides 
or proteins which are growth inhibitors or are cytocidal 
for tumor cells (e.g., interferon a, /3 or y, interleukins 
1-7, tumor necrosis factor (TNF) a or j8) and, thus, are 

15 useful for treating certain human cancers (e.g., bladder 
cancers, melanomas) . Homologously recombinant slow- 
growing mycobacteria of the present invention are also 
useful vehicles to elicit protective immvinity in a host, 
such as a human or other vertebrate. They can be used to 

20 produce humoral antibody immunity, cellular immunity 

and/or mucosal or secretory immunity. The antigens ex- 
pressed by the homologously recombinant slow-growing 
mycobacteria, useful as vaccines or as diagnostic re- 
agents, are also the subject of the present invention. In 

25 addition, homologously recombinant slow-growing myco- 
bacteria of the present invention are useful as vaccines 
in which the heterologous DNA introduced through homo- 
logous integration is not itself expressed, but acts to 
knock out a mycobacterial gene necessary for pathogenicity 

30 of the slow-growing mycobacterium or its growth in vivo . 
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Such homologously recombinant slow- growing mycobacteria 
are useful as vaccines to provide protection against 
diseases caused by the corresponding wild- type myco- 
bacterium or as a vaccine vehicle which contains a gene(s) 
5 encoding an antigen(s) of a different pathogen{s) (e.g., 
as a vaccine to provide protection against an organism 
other than the corresponding wild-type mycobacterium or 
against a toxin or toxoid) . 

The vaccine of the present invention has important 

10 advantages over presently available vaccines. For exam- 
ple, mycobacteria have adjuvant properties; they stimulate 
a recipient ' s . immune system to respond to other antigens 
with great effectiveness. In addition, the mycobacterium 
stimulates long-term memory or immunity. This means that 

15 a single (one time) inoculation can be used to produce 

long-term sensitization to protein antigens. Long-lasting 
T cell memory, which stimulates secondary antibody re- 
sponse neutralizing to the infectious agent or toxic. 
This is particularly useful, for example, against tetanus 

20 and diphtheria toxins, pertussis, malaria, influenza, 
herpes viruses and snake venoms, 

BCG in particular has important advantages as a 
vaccine vehicle. For example, it can be used repeatedly 
in an individual and has had a very low incidence of 

25 adverse effects. In addition, BCG, as well as other 

mycobacteria, have a large genome (approximately 3x10^ bp 
in length) . As a result, a large amount of heterologous 
DNA can be accommodated within (incorporated into) the 
mycobacterial genome, which means that a large gene or 

30 multiple genes (e.g., DNA encoding antigens for more than 
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one pathogen) can be inserted into genomic DNA, such as by 
homologous recombination . 

In another embodiment of the present invention, the 
method of homologous recombination is used to manipulate 
5 the genome of a mycobacterium, such as M. tuberculosis, in 
order to attenuate the organism and render it more suit- 
able for vaccine purposes (e.g., to render M. tuberculosis 
more suitable for use as a tuberculosis vaccine which 
would induce a protective response and minimize or reduce 

10 the risk of disease to the recipient) . For example, a 

gene or genes encoding a product responsible for pathoge- 
nicity of the. organism can be deleted or inactivated 
(e.g., by disruption) . For tuberculosis , such genes 
include, but are not limited to, the kat G gene, the Hsp6 0 

15 gene, aroA, IvsA , uraA and the tuberculosis DNA associ- 
ated with entry into and survival inside cells. The 
heterologous DNA used includes DNA homologous to genomic 
DNA of M. tuberculosis , which is introduced into genomic 
DNA in a location which results in deletion or disruption 

20 of the tuberculosis gene(s), resulting in attenuation 
of the M. tuberculosis . 

Brief Description of the Ficrures 

Figure 1 is a structural and functional map of the 
bovis BCG uraA locus, in which a restriction map of the 
25 uraA locus and the recombinant insert DNAs for several 
plasmids used to study this region are depicted. The 
relative positions of the BCG uraA gene and the portions 
of other genes identified are summarized graphically and 
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the ability of each recombinant to complement the E, coli 
TDvr F mutant is indicated. 

Figure 2 is the nucleic acid sequence of the BCG uraA 
locus (Seq ID No. 1) and the predicted protein products 
5 (Seq ID No. 2) . 

Figure 3 is a schematic representation of integration 
by homologous recombination in BCG. The uraA locus in 
wild-type BCG (top) , the transforming DNA (middle) and a 
BCG transformant in which the transforming DNA fragment 
10 has integrated via homologous recombination (bottom) are 
represented. 

Figure 4 is a schematic representation of the South- 
ern analysis of the BCG transformant represented in Figure 
3. 

15 Figure 5 shows the results of Southern blot analysis 

of genomic DNA isolated from wild-type BCG (WT) and a BCG 
transformant (6015-9) . The positions of DNA markers are 
indicated to the right and the apparent size of each of 
the hybridizing DNA bands is indicated to the left , 

Detailed Description of the Invention 

As described herein. Applicants have demonstrated 
introduction of heterologous DNA into slow-growing myco- 
bacteria (transformation of heterologous DNA into slow- 
growing mycobacteria) and incorporation of heterologous 
DNA at a homologous locus in genomic DNA of slow-growing 
mycobacterial (integration of heterologous DNA into the 
genomic DNA through homologous recombination) . As a 
result, they have produced homologously recombinant slow- 
growing mycobacteria having heterologous DNA integrated at 



20 
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a homologous locus in their genomic DNA. In particular, 
as described herein. Applicants have introduced hetero- 
logous DNA into bovis BCG (BCG) and demonstrated that 
it is present in the resulting homologously recombinant 
5 BCG at a genomic location homologous with sequences pre- 
sent in the DNA construct transformed into the BCG. The 
DNA construct introduced into BCG by the method described 
herein included heterologous DNA containing the uraA locus 
(homologous DNA) and nonhomologous DNA (a selectable 

10 marker gene) ; the heterologous homologous DNA flanked the 
nonhomologous DNA in the construct. Both the heterologous 
homologous DNA and the heterologous nonhomologous DNA in 
the DNA construct were shown to have integrated into the 
genome of the recipient mycobacterial cells at a homo- 

15 logous locus (a genomic locus homologous with the DNA 

including and/or flanked by the homologous DNA in the DNA 
construct) . 

As a result of the work described herein, a method of 
transforming slow growing mycobacteria, a method of intro- 

2 0 ducing heterologous DNA into genomic DNA of a slow growing 

mycobacterium through integration at a homologous locus, 
DNA constructs useful in the method of introducing heter- 
ologous DNA into a homologous locus in genomic DNA of a 
slow growing mycobacteria, homologously recombinant slow 
25 growing mycobacteria which contain heterologous DNA at a 
homologous locus in their genomic DNA, a BCG gene encoding 
orotidine-5' -monophosphate decarboxylase (BCG OMP DCase) 
and homologously recombinant slow growing mycobacteria 
useful as vaccines are available. The following is a 

3 0 description of the present method, DNA constructs and 
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vaccines, as well as the isolated BCG OMP DCase gene and 
its use. 

The present invention includes an improved method of 
transforming slow growing mycobacteria. In the present 
5 method, slow growing mycobacteria are subjected to elec- 
troporation in water, preferably after exposure to (cul- 
turing in the presence of) glycine prior to electro- 
poration and preferably also while they are in mid-log 
growth. Slow growing mycobacteria to be transformed with 

10 heterologous DNA are combined with the heterologous DNA 
(which can be plasmid/circular DNA or linear DNA) in 
water. The resulting combination is subjected to electro- 
poration under conditions (e.g., potential, capacitance 
and time) sufficient for entry of the heterologous DNA 

15 into the slow growing mycobacteria. Electroporation is 
carried out at approximately 2 to 2,5 kV potential and 
approximately 1 to 125 /iF capacitance for approximately 4 
to 40 milliseconds. In a specific embodiment, slow grow- 
ing mycobacterial cells are electroporated in water at 

20 approximately 2 . 5 kV potential and approximately 25 fiF 
capacitance for 5-6 milliseconds. In a further embodi- 
ment, slow growing mycobacteria to be transformed are 
exposed to glycine (e.g., 1 to 2% glycine) by addition of 
glycine to culture medium prior to harvest of the cells. 

25 In a particular embodiment, slow growing mycobacteria are 
exposed to 1.5% glycine, which is added to culture medium, 
for approximately 24 hours prior to harvest of the cells 
for transformation. In another embodiment, slow-growing 
mycobacteria are in mid- log growth when they are trans - 

30 formed. The cells can also have been exposed to glycine. 
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as described above, prior to elect roporat ion, although 
that is not necessary. The mid- log slow growing mycobac- 
teria are combined with heterologous DNA to be introduced 
into them and subjected to elecroporation in water, as 
5 described above, resulting in transformation of the heter- 
ologous DNA into slow growing mycobacteria in the combina- 
tion. 

The heterologous DNA introduced into slow growing 
mycobacteria by the present method is DNA obtained from 

10 any source other than the mycobacterium into which it is 
being introduced. It can be of viral, bacterial, myco- 
bacterial, invertebrate or vertebrate (including human and 
other mammalian) origin, can be obtained from other orga- 
nisms, such as parasites, or can be produced to have the 

15 same nucleic acid sequence as the DNA in its naturally 

occurring source. Alternatively, it can be modified DNA. 
The DNA introduced can be plasmid (circular) DNA or linear 
DNA. The heterologous DNA contains DNA homologous to a 
locus in genomic DNA of the recipient slow growing myco- 

20 bacteria, DNA nonhomologous to a locus in genomic DNA of 
the recipient cells or both. It is possible to combine 
slow growing mycobacteria and a DNA construct in which the 
heterologous DNA is only nonhomologous DNA and carry out 
the present method of transformation, if the goal is to 

25 transform slow growing mycobacteria with greater ef- 
ficiency than is possible with existing methods. Hetero- 
logous DNA introduced in this manner will integrate ran- 
domly into genomic DNA. 

In order to produce homologous ly recombinant slow 

3 0 growing mycobacteria through homologous integration be- 
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tween mycobacterial genomic DNA and heterologous DNA, the 
DNA construct must include sufficient DNA homologous with 
mycobacterial DNA to cause integration of the construct 
into a homologous genomic locus. If only homologous DNA 
5 is present in the DNA construct used (e.g., in a construct 
introduced in order to knock out or activate endogenous 
mycobacterial DNA) , at least 400 bp of homologous DNA will 
generally be used. If the DNA construct includes homolo- 
gous DNA (for directing or targeting introduction into 

10 mycobacterial genomic DNA) and nonhomologous DNA (e.g., 
DNA encoding a product to be expressed in homologously 
recombinant slow growing mycobacteria) , there is homolo- 
gous DNA on both sides of (flanking both ends of) the 
nonhomologous DNA. In general, there will be at least 

15 approximately 250 bp of homologous DNA on each side of the 
nonhomologous DNA, although shorter flanking homologous 
sequences can be used, provided that they are of suf- 
ficient length to undergo homologous recombination with 
genomic sequences, resulting in their introduction into 

20 mycobacterial genomic DNA (alone or in conjunction with 

nonhomologous DNA with which the homologous DNA is present 
in the DNA construct) . In the embodiment described in the 
examples, 1-5 kb of homologous DNA (1.5 kb of uraA flank- 
ing sequence) has been shown to result in homologous 

25 integration, along with nonhomologous DNA, into the uraA 
locus of bovis BCG. 

The homologous DNA present in the DNA construct can 
be any DNA homologous to DNA present in genomic DNA of the 
recipient slow growing mycobacterium . Specifically de- 

30 scribed herein is the isolation and sequencing of the M. 
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bovis BCG OMP DCase gene (uraA) and its use to introduce 
heterologous nonhomologous DNA into M. bovis BCG genomic 
DNA at a homologous locus. As described in the examples, 
a BCG DNA fragment which included the OMP DCase coding 
5 sequence was modified to remove the OMP DCase coding 

sequence and replace it with heterologous nonhomologous 
DNA encoding a selectable marker gene (i.e., the Kan'' 
(aph) gene) . Specifically, the DNA construct was made by 
removing the OMP DCase coding sequence from a 4.4 kb BCG 

10 DNA fragment containing ura A and replacing it with the 
Kan^ gene (aph) , to produce a DNA fragment in which the 
selectable marker gene is flanked by 1 . 5 kb uraA DNA (to 
direct homologous recombination or integration of the 
homologous DNA and, along with it, the nonhomologous DNA 

15 into mycobacterial genomic DNA). All or a portion of the 
OMP DCase gene can be used, with similar modifications, as 
a component of a DNA construct including other heterolo- 
gous nonhomologous DNA to be introduced into bovis BCG 
genomic DNA at the uraA locus. Alternatively, other M^ 

20 bovis BCG genes can be used as the heterologous homologous 
component of a DNA construct useful for introducing heter- 
ologous nonhomologous DNA into the mycobacterium. Simi- 
larly, DNA from other slow growing mycobacteria (e.g., IL. 
leprae . M, tubeT-culosis . M. avium . M^ africanum) can be 

25 incorporated into a DNA construct to be used for homolo- 
gous recombination in the respective slow growing mycobac- 
teria . 

The heterologous nonhomologous DNA in the DNA con- 
struct introduced into slow growing mycobacteria by the 
3 0 present method can be any DNA which is expressed in the 



BNSDCXID: <WO 95034 17A2_1_> 



wo 95/03417 



PCT/US94/08267 



-18- 

slow-growing mycobacteria or which is not expressed in the 
recipient mycobacteria but alters mycobacterial protein 
expression or function. For example, the heterologous 
nonhomologous DNA can be DNA encoding an antigen (s) of a 
5 pathogen or pathogens. A pathogen is any virus, micro- 
organism, other organism or substance (e.g., toxins, 
toxoids) which causes a disease or undesirable condition. 
Homologously recombinant slow growing mycobacteria which 
express a protein antigen (s) from malaria sporozoites, 

10 malaria merozoites, diphtheria toxoid, tetanus toxoid, 
Leishmania, Salmonella, M, africanum , M . intracellulare , 
M. avium , treponema, pertussis, herpes virus, measles 
virus, mumps. Shigella, Neisseria, Borrelia, rabies, 
poliovirus, human immunodeficiency virus (HIV) , Simian 

15 immunodeficiency virus (SIV) , snake venom, insect venom or 
vibrio cholera can be produced using the method of the 
present invention. Homologously recombinant M. bovis BCG, 
which, in a nonhomologously recombinant form, has long 
been successfully administered as a vaccine in humans can 

20 be used. The DNA encoding the protein antigen (s) can be 
obtained from sources in which it naturally occurs or can 
be produced through known recombinant techniques or known 
chemical synthetic methods. For example, the DNA can be 
produced by genetic engineering methods, such as cloning 

25 or by the polymerase chain reaction (PGR) . 

A multipurpose or multifunctional vaccine (one which 
contains and expresses heterologous DNA encoding antigens 
from more than one pathogen) can be produced by the pres- 
ent method. In this embodiment, one or more DNA con- 

3 0 struct s are used to introduce heterologous homologous DNA 
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and heterologous nonhomologous DNA (DNA encoding an anti- 
gen against which protection is desired) into the slow 
growing mycobacterium . If one construct is used, it 
includes DNA encoding the antigens of interest, flanked by 
5 homologous DNA sufficient for introduction of the heterol- 
ogous DNA into a homologous locus in the mycobacterium. 
More than one construct can be used; in this case, each 
includes homologous DNA and nonhomologous DNA encoding an 
antigen of interest. A multifunctional vaccine of the 

10 present invention can be homologously recombinant BCG 

which contains, within its genomic DNA, a gene encoding an 
antigen for M. leprae , a gene encoding an antige.n for 
tuberculosis > a gene encoding an antigen for malaria and a 
gene encoding an antigen for Leishmania; these sequences 

15 are flanked by heterologous sequences homologous with BCG 
DNA and are introduced into the BCG genome by homologous 
integration . 

It is not necessary that heterologous nonhomologous 
DNA be expressed by homologously recombinant slow growing 

20 mycobacteria of the present invention or even that there 
be heterologous nonhomologous DNA present. For example, 
in one embodiment, heterologous nonhomologous DNA is 
incorporated into genomic DNA of slow growing mycobacteria 
for the purpose of inactivating an endogenous myco- 

25 bacterial gene, such as a gene necessary for the patho- 
genicity of the mycobacterium. Any gene involved in 
metabolism necessary for pathogenicity of the slow growing 
mycobacterium (or for its growth in humans or other ani- 
mals) but whose absence (e.g., from being knocked out) 

30 does not prevent it from being cultured can be targeted 
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for inactivation. For example, the aroA gene of tuber- 
culosis can be inactivated. In another embodiment, heter- 
ologous nonhomologous DNA is introduced in order to acti- 
vate or turn on an endogenous mycobacterial gene. In 
5 either case, the heterologous nonhomologous DNA need not 
be expressed. 

Inactivation by disruption or deletion of a gene or 
genes associated with pathogenicity of HL. tuberculosis can 
be carried out to produce attenuated tuberculosis which 

10 is useful as a vaccine. There are a number of tubercu- 
losis genes with interesting functions that can be knocked 
out (deleted) or disrupted to produce attenuated tuber- 
culosis . These include katG (Heym, B. et al . , J. Bacte- 
riol . , 175 :4255-4259 (1993); Zhang, Y. et al . , Nature, 

15 358:591-593 (1992)), the "spare" mycobacterial Hsp60 gene 
(Hong, T.H- et al . , Proc . Na tl. Acad. Sci . , U.S.A., 
90:2608-2612 (1993), aroA (Garbe, T. et al . , J. Bacte- 
riol . . 172 :6774-6782 (1990), lysA (Andersen, A.B. et al . , 
Gene , 12^:105-109 (1993), uraA (Aldovini, A. et al . , 

20 Bacteriol . 175:7282-7289 (1993) and the 51^ tuberculosis 

DNA associated with entry and survival inside cells (Arru- 
da, S. et al.. Science , 261 :1454-1457 (1993). This is not 
a complete list of candidate genes, but is provided as an 
example of the types of genes that can be targeted. 

25 The k^tG gene is useful because 1) it is known that 

katG can be deleted without creating a lethal mutation and 
2) deletion will permit selection for isoniazid resis- 
tance, allowing for a quick selection for recombinants 
that have undergone homologous recombination at the legit - 

3 0 imate locus. 
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M. tuberculosis expresses two chaperonin-6 0 homologs 
(Hong, T.H. et al . , Proc . Na til . Acad. Sci . , U.S.A., 
9^:2608-2612 (1993), only one of which appears to be a 
major target of the immune response (the 65 kDa antigen) , 
5 and it is thought that the "65 kDa antigen" may contribute 
to pathogenesis. Knockout mutations in each of these two 
genes can be made to investigate whether they are lethal. 
Heterologous DNA is introduced into NL. tuberculosis by the 
present method by combining the tuberculosis and heter- 

10 ologous DNA whose introduction into tuberculosis genom- 
ic DNA associated with pathogenesis of the organism, thus 
resulting in attenuation of the pathogenicity of the 
recipient tuberculosis . The heterologous DNA can be 
DNA homologous to genomic tuberculosis DNA or DNA 

15 homologous to genomic tuberculosis DNA and DNA which is 
not homologous to genomic NL. tuberculosis . 

Heterologous DNA can be homologous DNA only; it is 
not necessary that heterologous nonhomologous be present. 
For example, homologous DNA can be introduced into an 

20 endogenous mycobacterial gene (such as one essential for 
the pathogenicity of a slow growing mycobacterium) in 
order to disrupt or inactivate that gene. This is partic- 
ularly useful in those embodiments in which an attenuated 
or disabled mycobacterium is desired, such as for use as a 

25 vaccine to elicit an immune response against the myco- 
bacterium itself or as a vehicle to be used in a similar 
manner to that in which homologously recombinant BCG can 
be used (to express antigens of other pathogens) . 

Homologously recombinant slow growing mycobacteria of 

3 0 the present invention can be administered by known methods 
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and a variety of routes (e.g., intradermally, intra- 
muscularly, intravenously) . They are useful as vehicles 
in which the heterologous nonhomologous DNA is expressed 
and as modified slow grow mycobacteria (e.g., mycobacteria 
5 with reduced or abolished pathogenicity) which are dis- 
abled or attenuated and, thus, useful as vaccines. 

The present invention will now be illustrated by the 
following examples, which are not to be considered limited 
in any way. 

10 MATERIALiS AND METHODS 

Strains and nlasmids. HU bovis BCG used for DNA 
isolation and subsequent construction of the recombinant 
BCG plasmid and Xgtll libraries was the Montreal Strain, 
ATCC #35735. M^ bovis BCG was grown in Middlebrook 7H9 

15 media, supplemented with 0,05% Tween 80, as described in 
Aldovini and Young, Nature 3^:479-482, (1991), coli 
strain Y1107 {pyrF: iMu trpam lacZam hsdR- m+ su-) was 
obtained from D. Botstein, Plasmids were propagated in 
the E^ coli strain DHSof from Bethesda Research Laborato- 

2 0 ries. E^ coli cultures used for plasmid selection were 
grown in Luria Bertani broth or agar with 5 0 /xg/ml ampi- 
cillin. Phage M13 used for the production of single 
stranded DNA were propagated in E^ coli strain JMlOl from 
New England BioLabs . JMlOl was grown in YT medium (Mania- 

25 tis) . Genomic libraries were generated using pUC19 from 
Bethesda Research Laboratories. Plasmid pY6002 (Husson et 
al . , J. Bacterid 172:519-524 1990) was the source of 
the 1.3 kb BamHI DNA fragment containing the amino- 
glycoside phosphotransferase gene aph. 
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Enzymes. Klenow fragment of coli DNA polymerase 
was supplied by Promega. T7 polymerase, and Taq polymer- 
ase (Sequenase and Taquence) were provided by United 
States Biochemical . 
5 Recombinant DNA library construction. To isolate BCG 

DNA, cells were harvested by centrif ugation, washed, and 
resuspended in 50 wM Tris (pH 8.0), 10 mM EDTA, 10% su- 
crose, and 0.5 mg/ml lysozyme, and incubated at 37 degrees 
for one hour. EDTA was then added to 1%, and the mixture 

10 was incubated at room temperature for 15 minutes. Three 
phenol /chlorofoirm extractions were performed, followed by 
RNase treatment, phenol/chloroform extraction, chloroform 
extraction and ethanol precipitation. The DNA was then 
resuspended in TE buffer, (10 mM Tris pH 7.5, imM EDTA) . 

15 To construct the plasmid library, the DNA was sub- 

jected to partial digestion with Sau3A and DNA fragments 
of 2-6 kb were isolated by agarose gel electrophoresis 
onto DE81 paper and eluted in buffer containing 10 mM 
Tris, HCl, IM NaCl and 1 mM EDTA. The DNA fragments were 

2 0 then phenol -chloroform extracted, ethanol precipitated and 
ligated into BamHl digested, calf -intestinal phosphatase 
treated pUC19 plasmid vector. E^ coli cells were trans- 
formed with the ligated mixture, and approximately 4 X 10^ 
recombinants were obtained- Plasmid DNA was obtained from 

25 the pool of transformed colonies using an alkaline lysis 
method. 

The Xgtll library was constructed using a procedure 
described by Young. (Young, R.A. , et al . , Proc. Natl. 
Arad. Sci. . USA . 82:2583-2587 (1985)). Briefly, BCG 
30 genomic DNA was subjected to random partial digestion with 
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DNase I, EcoRI linkers were added to the digestion prod- 
ucts, and DNA fragments of 4-8 kb were isolated by agarose 
gel electrophoresis and electroelution , The DNA fragments 
were then ethanol precipitated and ligated into EcoRI - 
5 digested Xgtll arms. The ligation mixture was packaged 
into X heads and the packaging mixture was used to infect 
E . coli. Approximately 5 X 10^ recombinants were ob- 
tained . 

EXAMPLE 1. 

10 Isolation of BCG OMP DCase gene bv compleme ntation and 

plasmid DNA manipulation. The BCG recombinant library was 
used to transform the coli strain Y1107. Twenty-one 
transf ormants capable of growing in the absence of uracil 
were isolated, of which six were chosen for further evalu- 

15 ation by restriction analysis. Plasmid DNA was isolated 
by alkaline lysis from cells grown in liquid culture, and 
restriction analysis indicated that all of these plasmids 
contained the same or very similar insert DNAs . One of 
these clones (pY6006) was used for further study (see 

20 Figure 1) - A 0 , 6 kb BamHI DNA fragment from pY6006 was 

used to screen the Xgtll library, leading to the isolation 
of phage Y3030. This phage carries a 5 . 6 kb EcoRI BCG DNA 
insert containing the OMP DCase gene. This insert DNA was 
subcloned into pGEMz (f + ) to generate pYGOll. The 4 . 4 kb 

25 Sad -EcoRI fragment of the y303 0 insert was subcloned into 
pUC19 to generate pY6014. Plasmid pY6015 was derived from 
pY6014 by replacing uraA sequences with the aph gene; a 
1.15 kb Hindi DNA fragment containing uraA sequences was 
removed by partial Hindi digestion of pY60l4 DNA, and it 
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was replaced with a 1 . 3 Icb BamHI fragment containing aph 
from pY6002 that was blunt -ended with Klenow. 

DNA 5ecruf^nrP> analysis. The M, bovis BCG uraA gene 
was sequenced from the 4.4 kb SacI-EcoRI fragment of the 
5 Xgtll phage Y3030 cloned into M13 in both orientations. 

The same DNA fragment was subcloned into pUC19 to generate 
pY6014 for further manipulation. Single strand DNA for 
sequence analysis was prepared from M13 grown in JMlOl 
(Viera and Messing, Methods Enzvmol . . 153.: 3-11 1987) . 

10 Both DNA strands were sequenced using the dideoxy-method 
of Sanger (Sanger et al . , Proc. Na tl. Acad. Sci. USA 
74:5463-5467, .1977) . Mycobacterial DNA has a high GC 
content, and two different strategies were used to reduce 
band compression and other artifacts due to high G+C 

15 content. A subset of the reactions was carried out using 
Taq polymerase at high temperature (70*^C) . In addition, 
dGTP and dITP were used in independent sequence reactions 
(Kimsey and Kaiser, .T. Biol. Chem. 267:819-824, 1992). 

RESULTS 

20 Isolation of the BCG OMP decarboxylas e oene by genetic 

complementation . The complementation strategy employed to 
isolate the BCG OMP DCase gene was similar to that em- 
ployed previously to isolate the homologous gene in 
smeomatis (Husson et al . , .T. Bacterid. 172:519-524, 

25 1990) . A recombinant library was constructed in the 
coli vector pUC19 using size selected BCG genomic DNA 
fragments from a partial SauIIIA digest. An E. coli pyrF 
mutant strain (Y1107) was transformed with this library 
and cells were plated on medium lacking uracil to select 



BNSDOCIO: <WO 950341 7A2_I_> 



wo 95/03417 



PCT/US94/08267 



-26- 



for uracil prototrophs, and on rich medium containing 
ampicillin to ascertain the transformation frequency and 
to estimate the fraction of transf ormants that were able 
to complement the E. coli pyrF defect. Approximately 
5 0.05% of the cells transformed with the recombinant li- 
brary became uracil prototrophs . DNA clones were obtained 
from six colonies able to grow in the absence of uracil, 
and restriction analysis revealed that these clones con- 
tained the same insert DNA. One of these clones, pYeoOG, 

10 was subjected to further study (Figure 1) • 

To identify, the portion of the 3 , 5 kb insert DNA 
pY6006 that was responsible for complementation, the 
1.3-kb BamHI fragment of Tn903, which encodes amino- 
glycoside transferase (aph) , was inserted into several 

15 different sites in pY6006 insert DNA, the resultant plasm- 
ids were reintroduced into the E. coli pyrF mutant strain, 
and the ability of the new plasmids to complement the 
mutant phenotype was assessed as before (Figure 1) . One 
of the three plasmids with insertion mutations, pY6006B, 

2 0 lost the ability to complement the pyrF mutant phenotype, 
suggesting that sequences necessary for the complementing 
activity are located in the vicinity of the BamHI site 
that is disrupted in pY6 006B. 

Analysis of DNA sequences for the left end of pY6006 

25 insert DNA (as diagrammed in Figure 1) revealed that the 
open reading frame of the pUC19 lacZ gene in this plasmid 
continues uninterrupted into an open reading frame for a 
polypeptide similar in sequence to OMP decarboxylase 
proteins. This preliminary data suggested that the left 
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end of pY6006 insert DNA encoded the amino- terminus of the 
BCG OMP decarboxylase protein. 

For later experiments, it was important to have both 
the OMP decarboxylase gene and a substantial amount of 
5 flanking sequences. To obtain genomic DNA that contains 
both the OMP decarboxylase gene and its flanking se- 
quences, the 0.6 kb BamHI DNA fragment from pY6006 was 
used to probe a Xgtll library, of M. bovis BCG DNA, as the 
Agtll library contains insert DNA fragments whose size, on 

10 average, is larger (4-8 kb) than the plasmid library used 
to obtain pY6006. A lambda clone (Y3030) was isolated 
which contains a 5.6 kb EcoRI DNA insert that overlaps 
that of py600€. The 5.6 kb EcoRI DNA fragment, and a 4.4 
kb SacI-EcoRI subfragment, were subcloned into plasmid 

15 vectors to generate pY6011 and pY6014, respectively (Fig- 
ure 1) . Both pY6011 and pY6014 were able to complement 
the defect of the E. coli pyrF mutant strain Y1107. 

Sequence of the BCG OMP decarboxylase aene a nd flank- 
ing DNA. DNA fragments, from phage Y3 03 0 insert DNA were 

20 subcloned into M13 vectors and subjected to sequence 

analysis. Sequences were determined for both DNA strands, 
and most of the sequence reactions were duplicated with 
ITP replacing GTP to minimize artifacts due to the GC-rich 
nature of mycobacterial DNA. Figure 2 shows the sequences 

25 obtained for the BCG OMP decarboxylase gene (uraA) (nucle- 
otides 1691-2512) and for flanking DNA. The predicted BCG 
OMP decarboxylase protein sequence is 274 amino acids 
long, similar in size to other OMP decarboxylase proteins. 
The 274 amino acids of the predicted BCG OMP decarboxylase 

30 protein are bracketed in Figure 2. When the BCG decarbox- 
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ylase protein sequence was used to screen the available 
databases for similar sequences, the results revealed that 
the BCG protein is closely related to the Mvxococcus 
xanthus OMP DCase (Kimsey and Kaiser, J, Biol. Chem. 
5 267:819-824, 1992) and more distantly related to the other 
known prokaryotic and eukaryotic OMP DCases, Comparison 
of the BCG and xanthus OMP decarboxylases reveals that 
4 0% of the amino acid residues are identical. In con- 
trast, only 17% of the residues of the BCG and E. coli 

10 proteins and 22% of the amino acids of the M. xanthus and 
E. coli proteins are identical, although there are a 
substantial number of conservative amino acid substitu- 
tions among these proteins. The relationship of M. xanth- 
us OMP decarboxylase to homologues in other prokaryotes 

15 and in eukaryotes was recently described in some detail 
(Kimsey and Kaiser, J. Biol. Chem. 267 ;819-824, 1992) . 
This comparative sequence analysis revealed that there are 
four regions which are more highly conserved, and the 
predicted BCG OMP decarboxylase also shares this feature 

20 with the other homologues. It is interesting to note that 
Mycobacteria and Mvxococci both have GC- rich genomes, but 
this alone does not account £or the degree of sequence 
conservation between the OMP decarboxylases from these two 
proaryotes; rather, the two genuses appear to be more 

25 closely related to one another than either is to the other 
prokaryotes for which OMP decarboxylase sequence are 
available . 

Further analysis of the BCG genomic DNA sequences 
revealed that the 1 . 7 kb sequence upstream of OMP decar- 
3 0 boxylase coding sequences contains a single large open 
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reading frame. This open reading frame has no apparent 
beginning in the cloned DNA fragment, suggesting that it 
is the coding sequence for the carboxy- terminus of a 
larger protein. A screen of the sequence database re- 
5 vealed that the 4 97 amino acid residues of the predicted 
protein are highly homologous to the carboxyl termini of 
the large subunit of carbamoyl phosphate synthase. For 
example, the 497 amino acid carboxy terminus of the puta- 
tive M. bovis BCG protein was 46% identical to the compa- 

10 rable segment of the E. coli carbamoyl phosphate synthase 
subunit, which is encoded by the carB gene (Nyunoya and 
Lusty, Proc. Natl. Acad. Sci . USA 80:4629-4633, 1983). 
Thus, the BCG carB gene appears to be located just up- 
stream of uraA. This is interesting because both carbamo- 

15 yl phosphate synthase and OMP decarboxylase are involved 
in pyrimidine biosynthesis. Carbamoyl phosphate synthase 
catalyzes the first reaction in pyrimidine biosynthesis, 
the production of carbamoyl phosphate, while OMP decarbox- 
ylase catalyzes the last step in the biosynthesis of UMP . 

20 Analysis of BCG DNA sequences downstream of the uraA 

gene revealed a -single large open reading frame that 
continues through the right end of the sequenced DNA 
fragment. This open reading frame predicts a protein of 
501 amino acids. A search of the computer database re- 

25 vealed that the protein predicted by this ORF is similar 
to previously described proteins from tuberculosis and 
M . leprae . The predicted BCG protein is similar to a 
putative M. tuberculosis antigen encoded downstream of 
the gene for the 65 kDa antigen {Shinnick, T.M. , J. Bac- 

30 teriol . 169:1080-1088, 1987) and to a leprae antigen 
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that may be an integral membrane protein (Vega -Lopez et 
al,, Infect . Immun . 61 : 2145-2154 , 1993). 

Southern analysis with whole genomic DNA revealed 
that there is a single copy of the uraA gene and flanking 
5 DNA in the BCG genome (see below) . The relative positions 
of the BCG uraA gene and the portions of other genes 
identified through sequence analysis are summarized graph- 
ically in Figure 1. The position of OMP decarboxylase 
sequences is consistent with the genetic analysis de- 

10 scribed above. The aph insertion mutations in plasmid 
pY6006 that adversely affected complementation of the 
coli OMP decarboxylase mutant occurred within OMP decar- 
boxylase coding sequences. Conversely, the aph insertion 
mutations that did not affect complementation of the E. 

15 coli OMP decarboxylase mutant occurred outside of the BCG 
OMP decarboxylase coding sequences , 



EXAMPLE 2 

BCG transformation. BCG Pasteur (ATCC) was grown in log 
phase to an ODgoo ^-^ Middlebrook medium. BCG cells 

20 were harvested by centrif ugation and washed twice with PBS 
(phosphate bufered saline) and resuspended in ImM MgCl (pH 
7.2), 10% sucrose, 15% glycerol at a concentration of 10 
ODgoo per ml, 0.4 ml of BCG cells was mixed with 2 ug of 
plasmid DNA and electroporated in a 0.2 cm cuvette. 

25 Electroporation settings were 2.5 kv potential and 25 fiF 
capacitance. After electroporation, cells were resus- 
pended in 10 ml Middlebrook medium and incubated at 37C 
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for 2 hours before plating on Middlebrook agar containing 
20 ug/ml kanamycin and, in some experiments, with uracil. 

Southern blot analysis. Genomic DNAs from BCG 
strains were isolated as described above, digested with 
5 restiction enzymes, subjected to agarose gel electrophore- 
sis in the presense of ethidium bromide, transfered to 
nitrocellulose, and probed with DNA labelled with 32P by 
random priming, all by standard procedure (Ausubel et al. , 
Current protocol in molecular biology (1987) . Green 
10 Publishing Associates and Wiley Interscience) . 

Tnti-oduction of foreign DNA into the BCG genome. 
Previous attempts to obtain homologous recombination in 
bovis BCG have apparently not been successful (Kalpana et 
al., Proc. Natl. Acad. Sci . USA 88:5433-5447, 1991; Young 

15 and Cole, Racteriol , 175:1-6, 1993). It is possible 

that the efficiency of transformation has an influence on 
the ability to obtain homologous recombination. To maxi- 
mize the transformation efficiency of BCG, we investigated 
the effect of adding glycine to the culture medium prior 

20 to harvesting cells for electroporation, as the presence 
of 1.5% glycine can affect the integrity of the cell wall 
and it seems to improve transformation effeciency in 
smegmatis (Mizuguchi and Takunaga, "Spheroplasts of Myco- 
bacteria. 2. Infection of Phage and Its DNA on Glycine 

25 Treated Mycobacteria and Spheroplasts" , Med. Biol., 77:57 
1968) - In addition, we compared the efficiency of elec- 
troporation of BCG cells in water relative to buffer. The 
autonomously replicating plasmid pYUB12 (Snapper et al . , 
Mol . Microbiol . 4:1911-1919, 1988) was used to determine 
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how these variables affected the relative efficiencies of 
transformation. The results are summarized in the Table 
under Experiment 1. Transformation efficiencies were 
improved substantially by exposing cultures to 1.5% gly- 
5 cine for 24 hours prior to harvest, and by performing the 
elect roporat ion in water rather than in buffer. 
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TABLE, BCG TreuiB formation Efficiencies 



m--oris- CiXycme i:*xcu»-xw- ^ ' • — 

forming DNA^ T^jattnent^ pr.^^tinr^ Medium^ Expt 1 Expt 2 Expt 3 



Glycine Electro- 



Transf ormants/ua DNA 



pYUB12 




Buffer 


50 






pYUB12 




Buffer 


250 






pYUB12 




Water 


500 






pYUB12 


+ 


Water 


10^ 


10^ 




None 


+ 


Water 


8 


6 


35 


p6015 (I) 




Buffer 




A 




p6015 (I) 


+ 


Buffer 




22 




p6015 (I) 




Water 




39 




p6015 (I) 


+ 


Water 




98 


500 



-The intact autonomously replicating plasmid pYroi2 was^ u^^^^^^ a 
control and the linear insert DNA of plasmid pY6015 [pY6015(I)J 
was used as integrating DNA. 

"Glycine was added to 1.5% to BCG cultures 24 hours prior to 
transformation. 

^The buffer is ImM MgCl (pH 7.2), 10% sucrose, 15% glycerol. 
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Experiments with linearized DNA molecules in yeast 
indicate that the ends of linear DNA molecules are recomb- 
inogenic; these ends may facilitate homologous integration 
by invading genomic DNA at homologous sites to initiate 
5 recombination (Rothstein, R., Meth., Enzymol . 194=281-301 
(1988)), The sequenced 4.4 kb BCG DNA fragment containing 
C/raA was used to investigate whether cloned DNA sequences 
could integrate at the homologous locus in bovis BCG. 
To mark the DNA fragment, the OMP decarboxylase coding 

10 sequence was replaced with a kanomycin-resistance gene 
(aph) to create pY6015 (Figure 3) . This left intact 
approximately 1.5 kb of UraA flanking sequences that could 
be used to direct homologous integration. The transforma- 
tion experiment described above for plasmid pyUB12 was 

15 repeated with pYGOlS insert DNA, and the results are 

summarized in the Table under Experiment 2. Again, trans- 
formation efficiencies were improved substantially by 
exposing cultures to 1.5% glycine £or 24 hours prior to 
harvest, and by performing the electroporation in water 

2 0 rather than in buffer. However, because the transforma- 
tion efficiencies obtained with the linear DNA were low, 
we made one additional attempt to improve these efficien- 
cies . 

Cultures of M. bovis BCG and other slow growing 
2 5 mycobacteria contain large numbers of cells that are 

inviable or that have an exceedingly long lag time after 
plating. Some investigators have suggested that mycobac- 
terial cells have an unusual ability to enter and maintain 
a dormant state, even when nutrients are available (Young 
30 and Cole, "Leprosy, Tuberculosis, and the New Genetics", 
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J. Bacteriol . . 175:1-6 1993) . We reasoned that mainte- 
nance of BCG cultures in mid- log growth might maximize the 
fraction of cells that were undergoing DNA synthesis and 
were competent to take up DNA and to incorporate it into 
5 homologous sites in the genome . A third experiment was 

performed, in which BCG cultures were diluted approximate- 
ly 1:4 every two days over a two-month period to ensure 
persistent log-phase growth before transformation. The 
results in the Table indicate that this approach produces 

10 a significant increase in the number of transf ormants ob- 
tained with either the autonomously replicating vector or 
the linear DNA fragment. 

Ten of the BCG colonies obtained in the third experi- 
ment were selected for further study after growing to 

15 adequate size for picking (24 days after plating) . The ten 
transf ormants were colony purified, and DNA was prepared 
from each- DNA preparations from the wild type strain and 
the ten transf ormants were digested with a variety of 
restriction endonucleases and Southern analysis revealed 

20 that the kanomycin-resistant BCG transf ormants all con- 
tained vector DNA integrated into the genome. In two of 
the ten transf ormants , the transforming DNA had integrated 
at the homologous locus. Figure 5 shows representative 
results from Southern analysis of the wild type strain and 

25 one of the BCG recombinants in which the cloned DNA inte- 
grated at the homologous locus . 

Equivalents 

Those skilled in the art will recognize, or be able 
to ascertain using not more than routine experimentation, 
30 many equivalents to the specific embodiments of the inven- 
tion described herein. Such equivalents are intended to 
be encompassed by the following claims. . 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: ^. , „ i. 

(A) NAME: Whitehead Institute for Biomedical Research 
<B) STREET: Nine Cambridge Center 

(C) CITY: Cambridge 

(D) STATE /PROVINCE: Massachusetts 

(E) COUNTRY: USA 

(F) POSTAL CODE/ZIP: 02142 

(G) TELEPHONE: 
(I) TELEFAX: 

(ii) TITLE OF INVENTION: Homologously Recombinant Slow Growing 
Mycobacteria and Uses Therefor 

(iii) NUMBER OF SEQUENCES: 2 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Hamilton, Brook, Smith & Reynolds, P.C. 

(B) STREET: Two Militia Drive 

(C) CITY: Lexington 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02173 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/095,734 

(B) FILING DATE: 22-JUL-1993 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Granahan, Patricia 

(B) REGISTRATION NUMBER: 32,227 

(C) REFERENCE/DOCKET NUMBER: WHI93-11MA PCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617-861-6240 

(B) TELEFAX: 617-861-9540 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



GAGCTCGACC 


CCGCCGCCGA 


AACAGAGGTG 


GCCCCGCAGA 


CCGAAAGGCC 


CAAGGTGCTG 


60 


ATCCTCGGTT 


CGGGGCCCAA 


TCGGATCGGC 


CAGGGTATCG 


AGTTCGACTA 


CAGCTGCGTA 


12 0 


CACGCGGCAA 


CCACGTTGAG 


CCAGGCTGGC 


TTTGAGACCG 


TGATGGTCAA 


CTGCAACCCG 


180 


GAGACCATGG 


TGTCCACCGA 


CTTCGACACC 


GCGGACAGGT 


TGTACTTCGA 


GCCGTTGACG 


240 


TTCGAGGACG 


TCTTGGAGGT 


CTACCACGCC 


GAAATGGAAT 


CCGGTAGCGG 


TGGCCCGGGA 


300 


GTGGCCGGCG 


TCATCGTGCA 


GCTCGGCGGC 


CAGACCCCGC 


TCGGCTGGCG 


CACCGGCTCG 


360 


CCGACGCCGG 


GTCCCGCTCG 


TGGGCACCCA 


CCGGAGGCCA 


TCGACCTGGC 


CGAGGATGCG 


420 


GCCGTTCGGC 


GACCTGCTGA 


. GCGAGGACTG 


CCGGCGCCAA 


AGTACGGCAC 


CGCAACCACT 


480 


TTCGCCCAGG 


CCCGCCGGAT 


CGCCGAGGAG 


ATCGGCTATC 


CGGTGCTGGT 


GCGGCCGTCG 


540 


TATGTGCTCG 


GTGGTCGCGG 


CATGGAGATC 


GTGTATGACG 


AAGAAACGTT 


GCAGGGCTAC 


600 


ATCACCCGCG 


CCACTCAGCT 


ATCCCCCGAA 


CACCCGGTGC 


TCGTGCACCG 


CTTCCTCGAG 


660 


GACGCGGTCG 


AGATCGACGT 


CGACGCTCTG 


TGTGATGGCG 


CCGAGGTCTA 


TATCGGCGGA 


720 


ATCATGGAGC 


ACATCGAGGA 


GGCCGGCATC 


CACTCCGGTG 


ACTCGGCCTG 


TGCGCTGCCA 


780 


CCGGTCACGT 


TGGGCCGCAG 


CGACATCGAG 


AAGGTGCGTA 


AGGCCACTGA 


AGCCATTGCG 


840 


GATGGCATCG 


GCGTGGTGGG 


GCTGCTCAAC 


GTGCAGTCCG 


CGCTCAAGGA 


TGACGTGCTC 


900 


TACGTCCTGG 


AAGCCAACCC 


GAGAGCGAGC 


CGTACCGTTC 


CGTTTGTATC 


CAAGGCCACA 


960 


GCGGTGCCAC 


TCGCCAAGGC 


ATGCGCCCGG 


ATCATGTTGG 


GCGCCACCAT 


TGCCCAGCTG 


1020 


CGCGCCGAAG 


GCTTGCTGGC 


GGTCACCGGG 


GATGGCGCCC 


ACGCGGCGCG 


AAACGCCCCC 


1080 


ATCGCGGTCA 


ACCAGGCCGT 


GTTGCCGTTT 


CACCGGTTCC 


GGCGCGCCGA 


CGGGGCCGCC 


1140 


ATCGACTCGC 


TACTCGGCCC 


GGAGATGAAA 


TCGACCGGCG 


AGGTGATGGG 


CATCGACCGC 


1200 


GACTTCGGCA 


GCCGGTTCGC 


CAAGAGCCAG 


ACCGCCGCCT 


ACGGGTCGCT 


GCCGGCCCAG 


1260 


GGCACAGTGT 


TCGTGTCGGT 


GGCCAACCGG 


GACAAGCGGT 


CGCTGGTGTT 


TCCGGTCAAA 


1320 


CCGATTGGCC 


CACCTGGGTT 


TTCGCGTCCT 


TGCCACCGAA 


GCACCGCAGA 


GATCTTGCGC 


1380 


CGCAACGGTA 


TTCCCTGCGA 


CGACGTCCGC 


AAACATTTCG 


AGCCGGCGCA 


GCCCGGCCGC 


1440 


CCCACAATGT 


CGGCGGTGGA 


CGCGATCCGA 


GCCGGCGAGG 


TCAACATGGT 


GATCAACACT 


1500 


CCCTATGGCA 


ACTCCGGTCC 


GCGCATCGAC 


GGCTATGAGA 


TCCGTTCGGC 


GGCGGTGGCC 


1560 


GGCAACATCC 


CGTGCATCAC 


CACGGTGCAG 


GGCGCATCCG 


CCGCCGTGCA 


GGGGATAGAG 


1620 
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GCCGGGATCC 


GCGGCGACAT 


CGGGGTGCGC 


TCCCTGCAGG 


AGCTGCACCG 


GGTGATCGGG 


1680 


GGCGTCGAGC 


GGTGACCGGG 


TTCGGTCTCC 


GGTTGGCCGA 


GGCAAAGGCA 


CGCCGCGGCC 


1740 


CGTTGTGTCT 


GGGCATCGAT 


CCGCATCCCG 


AGCTGCTGCG 


GGGCTGGGAT 


CTGGCGACCA 


1800 


CGGCCGACGG 


GCTGGCCGCG 


TTCTGCGACA 


TCTGCGTACG 


GGCCTTCGCT 


GATTTCGCGG 


1860 


TGGTCAAACC 


GCAGGTGGCG 


TTTTTTGAGT 


CATACGGGGC 


TGCCGGATTC 


GCGGTGCTGG 


1920 


AGCGCACCAT 


CGCGGAACTG 


CGGGCCGCAG 


ACGTGCTGGT 


GTTGGCCGAC 


GCCAAGCGCG 


1980 


GCGACATTGG 


GGCGACCATG 


TCGGCGTATG 


CGACGGCCTG 


GGTGGGCGAC 


TCGCCGCTGG 


2040 


CGGCCGACGG 


CGTGACGGCC 


TCGCCCTATT 


TGGGCTTCGG 


TTCGCTGCGG 


CCGCTGCTAG 


2100 


AGGTCGCGGC 


CGCCCACGGC 


CGAGGGGTGT 


TCGTGCTGGC 


GGCCACCTCC 


AATCCCGAGG 


2160 


GTGCGGCGGT 


GCAGAATGCC 


GCCGCCGACG 


GCCGCAGCGT 


GGCCCAGTTG 


GTCGTGGACC 


2220 


AGGTGGGGGC 


GGCCAACGAG. GCGGCAGGAC 


CCGGGCCCGG 


ATCCATCGGC 


GTGGTCGTCG 


2280 


GCGCAACGGC 


GCCACAGGCC 


CCCGATCTCA 


GCGCCTTCAC 


CGGGCCGGTG 


CTGGTGCCCG 


2340 


GCGTGGGGGT 


GCAGGGCGGG 


CGCCCGGAGG 


CGCTGGGCGG 


TCTGGGCGGG 


GCCGCATCGA 


2400 


GCCAGCTGTT 


GCCCGCGGTG 


GCGCGCGAGG 


TCTTGCGGGC 


CGGCCCCGGC 


GTGCCCGAAT 


2460 


TGCGCGCCGC 


GGGCGAACGG 


ATGCGCGATG 


CCGTCGCCTA 


TCTCGCTGCC 


GTGTAGCGGG 


2520 


TGCCCTGCCA 


CCGCGCCGCT 


AAATCCCACC 


AGCATGGGGT 


GGTGAGCCCA 


GCGCTCGTGT 


2580 


GACCAAACTC 


ACCGCCCTGG 


GCCGTCGTCA 


CGCTGTGTTA 


ACCTCTCGTT 


CAAATGATAT 


2640 


TCATATTCAA 


TAGTGGCGCT 


AAGTGTCCGG 


TTGAATCCCC 


GTTGAACCCC 


CAACAGATGG 


2700 


AGTCTGTGTC 


GTGACGTTGC 


GAGTCGTTCC 


CGAAAGCCTG 


GCAGGCGCCA 


GCGCTGCCAT 


2760 


CGAAGCAGTG 


ACCGCTCGCC 


TGGCCGCCGC 


GCACGCCGCG 


GCGGCCCCGT 


TTATCGCGGC 


2820 


GGTCATCCCG . 


CCTGGGTCCG 


ACTCGGTTTC 


GGTGTGCAAC 


GCCGTTGAGT 


TCAGCGTTCA 


2880 


CGGTAGTCAG 


CATGTGGCAA 


TGGCCGCTCA 


GGGGGTTGAG 


GAGCTCGGCC 


GCTCGGGGGT 


2940 


CGGGGTGGCC 


GAATCGGGTG 


CCAGTTATGC 


CGCTAGGATG 


CGCTGGCGGC 


GGCGTCGTAT 


3000 


CTCAGCGGTG 


GGCTATGACC 


GAGCCGTGGA 


TAGCCTTCCC 


TCCCGAGGTG 


CACTCGGCGA 


3060 


TGCTGAACTA 


CGGTGCGGGC 


GTTGGGCCGA 


TGTTGATCTC 


CGCCACGCAG 


AATGGGGAGC 


3120 


TCAGCGCCCA 


ATACGCAGAA 


GCGGCATCCG 


AGGTCGAGGA 


ATTGTTGGGG 


GTGGTGGCCT 


3180 


CCGAGGGATG 


GCAGGGGCAA 


GCCGCCGAGG 


CGTTAGTCGC 


CGCGTACATG 


CCGTTTCTGG 


3240 


CGTGGCTGAT 


CCAAGCCAGC 


GCCGACTGCG 


TGGAAATGGC 


CGCCCAGCAA 


CACGCCGTCA 


3300 


TCGAGGCCTA 


CACTGCCGCG 


GTAGAGCTGA 


TGCCTACTCA 


GGTCGAACTG 


GCCGCCAACC 


3360 


AAATCAAGCT 


CGCGGTGTTG 


GTAGCGACCA 


ATTTCTTTGG 


CATCAACACC 


ATTCCCATTG 


3420 
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CGATCAATGA 


GGCCGAGTAC 


GTGGAGATGT 


GGGTTCGGGC 


CGCCACCACG 


ATGGCGACCT 


3480 


ATTCAACAGT 


CTCCAGATCG 


GCGCTCTCCG 


CGATGCCGCA 


CACCAGCCCC 


CCGCCGCTGA 


3540 


TCCTGAAATC 


CGATGAACTG 


CTCCCCGACA 


CCGGGGAGGA 


CTCCGATGAA 


GACGGCCACA 


3600 


ACCATGGCGG 


TCACAGTCAT 


GGCGGTCACG 


CCAGGATGAT 


CGATAACTTC 


TTTGCCGAAA 


3660 


TCCTGCGTGG 


CGTCAGCGCG 


GGCCGCATTG 


TTTGGGACCC 


CGTCAACGGC 


ACCCTCAACG 


3720 


GACTCGACTA 


CGACGATTAC 


GTCTACCCCG 


GTCACGCGAT 


CTGGTGGCTG 


GCTCGAGGCC 


3780 


TCGAGTTTTT 


TCAGGATGGT 


GAACAATTTG 


GCGAACTGTT 


GTTCACCAAT 


CCGACTGGGG 


3840 


CTTTTCAGTT 


CCTCCTCTAC 


GTCGTTGTGG 


TGGATTTGCC 


GACGCACATA 


GCCCAGATCG 


3900 


CTACCTGGCT 


GGGCCAGTAC 


CCGCAGTTGC 


TGTCGGCTGC 


CCTCACTGGC 


GTCATCGCCC 


3960 


ACCTGGGAGC 


AATAACTGGT 


TTGGCGGGCC 


TATCCGGCCT 


GAGCGCCATT 


CCGTCTGCTG 


4020 


CGATACCCGC 


CGTTGTACCG . 


GAGCTGACAC 


CCGTCGCGGC 


CGCGCCGCCT 


ATGTTGGCGG 


4080 


TCGCCGGGGT 


GGGCCCTGCA 


GTCGCCGCGC 


CGGGCATGCT 


CCCCGCCTCA 


GCACCCGCAC 


4140 


CGGCGGCAGC 


GGCCGGCGCC 


ACCGCAGCCG 


GCCCGACGCC 


GCCGGCGACT 


GGTTTCGGAG 


4200 


GGCTTCCCGC 


CCTACCTGGT 


CGGCGGTGGC 


GGCCCAGGAA 


TAGGGTTCGG 


CTCGGGACAG 


4260 


TCGGCCCACG 


CCAAGGCCGC 


GGCGTCCGAT 


TCCGCTGCAG 


CCGAGTCGGC 


GGCCCAGGCC 


4320 


TCGGCGCGTG 


CGCAGGCGCG 


TGCTGCACGG 


CGGGGCCGCT 


CGGCGGCAAG 


GCACGTGGCC 


4380 


ATCGTGACGA 


ATTC 










4394 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1271 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(b) topoIjOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Glu Leu Asp Pro Ala Ala Glu Thr Glu Val Ala Pro Gin Thr Glu Arg 
15 10 15 

Pro Lys Val Leu lie Leu Gly Ser Gly Pro Asn Arg He Gly Gin Gly 
20 25 30 

He Glu Phe Asp Tyr Ser Cys Val His Ala Ala Thr Thr Leu Ser Gin 
35 40 45 

Ala Gly Phe Glu Thr Val Met Val Asn Cys Asn Pro Glu Thr Met Val 
50 55 60 
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Ser Thr Asp Phe 
65 

Phe Glu Asp Val 



Gly Gly Pro Gly 
100 

Pro Leu Gly Trp 
115 

His Pro Pro Glu 
130 

Pro Ala Glu Arg 
145 

Phe Ala Gin Ala 



Val Arg Pro Ser 
180 



Asp Glu Glu Thr 
195 

Pro Glu His Pro 
210 

lie Asp Val Asp 
225 

He Met Glu His 



Cys Ala Leu Pro 
260 

Arg Lys Ala Thr 
275 

Leu Asn Val Gin 
290 

Ala Asn Pro Arg 
305 

Ala Val Pro Leu 



He Ala Gin Leu 
340 

Ala His Ala Ala 
355 

Pro Phe His Arg 
370 



Asp Thr Ala Asp 
70 

Leu Glu Val Tyr 
85 

Val Ala Gly Val 



Arg Thr Gly Ser 
120 



Ala He Asp Leu 
135 

Gly Leu Pro Ala 
150 

Arg Arg lie Ala 
165 

Tyr Val Leu Gly 



Leu Gin Gly Tyr 
200 

Val Leu Val His 
215 

Ala Leu Cys Asp 
230 

He Glu Glu Ala 
245 

Pro Val Thr Leu 



Glu Ala He Ala 
260 

Ser Ala Leu Lys 
295 

Ala Ser Arg Thr 
310 

Ala Lys Ala Cys 
325 

Arg Ala Glu Gly 



Arg Asn Ala Pro 
360 

Phe Arg Arg Ala 
375 



Arg Leu Tyr Phe 
75 

His Ala Glu Met 
90 

He Val Gin Leu 
105 

Pro Thr Pro Gly 



Ala Glu Asp Ala 
14 0 

Pro Lys Tyr Gly 
155 

Glu Glu He Gly 
170 

Gly Arg Gly Met 
185 



He Thr Arg Ala 



Arg Phe Leu Glu 
220 

Gly Ala Glu Val 
235 

Gly He His Ser 
250 

Gly Arg Ser Asp 
265 

His Gly He Gly 



Asp Asp Val Leu 
300 

Val Pro Phe Val 
315 

Ala Arg He Met 
330 

Leu Leu Ala Val 
345 

He Ala Val Asn 



Asp Gly Ala Ala 
380 



Glu Pro Leu Thr 
80 

Glu Ser Gly Ser 
95 

Gly Gly Gin Thr 
110 

Pro Ala Arg Gly 
125 

Ala Val Arg Arg 



Thr Ala Thr Thr 
160 

Tyr Pro Val Leu 
175 

Glu He Val Tyr 
190 



Thr Gin Leu Ser 
205 

Asp Ala Val Glu 



Tyr He Gly Gly 
240 

Gly Asp Ser Ala 
255 

He Glu Lys Val 
270 

Val Val Gly Leu 
285 

Tyr Val Leu Glu 



Ser Lys Ala Thr 
320 

Leu Gly Ala Thr 
335 

Thr Gly Asp Gly 
350 

Gin Ala Val Leu 
365 

.He Asp Ser Leu 
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Leu Gly Pro Glu Met Lys Ser Thr Gly Glu Val Met Gly lie Asp Arg 
385 390 395 400 

Asp Phe Gly Ser Arg Phe hla Lys Ser Gin Thr Ala Ala Tyr Gly Ser 
^ 405 410 415 

Leu Pro Ala Gin Gly Thr Val Phe Val Ser Val Ala Asn Arg Asp Lys 
420 425 430 

Arg Ser Leu Val Phe Pro Val Lys Arg Leu Ala His Leu Gly Phe Arg 
435 440 445 

Val Leu Ala Thr Glu Ala Pro Gin Arg Ser Cys Ala Ala Thr Val Phe 
450 455 460 

Pro Ala Thr Thr Ser Ala Asn He Ser Ser Arg Arg Ser Pro Ala Ala 
465 470 475 480 

Pro Gin Cys Arg Arg Trp Thr Arg Ser Glu Pro Ala Arg Ser Thr Trp 
485 • 490 495 

Met Thr Gly Phe Gly Leu Arg Leu Ala Glu Ala Lys Ala Arg Arg Gly 
500 505 510 

Pro Leu Cys Leu Gly He Asp Pro His Pro Glu Leu Leu Arg Gly Trp 
515 520 525 

Asp Leu Ala Thr Thr Ala Asp Gly Leu Ala Ala Phe Cys Asp He Cys 
530 535 540 

Val Arg Ala Phe Ala Asp Phe Ala Val Val Lys Pro Gin Val Ala Phe 
545 550 555 560 

Phe Glu Ser Tyr Gly Ala Ala Gly Phe Ala Val Leu Glu Arg Thr He 
565 570 575 

Ala Glu Leu Arg Ala Ala Asp Val Leu Val Leu Ala Asp Ala Lys Arg 
580 585 590 

Gly Asp He Gly Ala Thr Met Ser Ala Tyr Ala Thr Ala Trp Val Gly 
595 600 605 

Asp Ser Pro Leu Ala Ala Asp Ala Val Thr Ala Ser Pro Tyr Leu Gly 
610 615 620 

Phe Gly Ser Leu Arg Pro Leu Leu Glu Val Ala Ala Ala His Gly Arg 
625 630 635 640 

Gly Val Phe Val Leu Ala Ala Thr Ser Asn Pro Glu Gly Ala Ala Val 
645 650 655 

Gin Asn Ala Ala Ala Asp Gly Arg Ser Val Ala Gin Leu Val Val Asp 
660 665 670 

Gin Val Gly Ala Ala Asn Glu Ala Ala Gly Pro Gly Pro Gly Ser He 
675 680 685 
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Gly Val Val Val Gly Ala Thr Ala Pro Gin Ala Pro Asp Leu Ser Ala 
690 695 700 

Phe Thr Gly Pro Val Leu Val Pro Gly Val Gly Val Gin Gly Gly Arg 
705 710 715 720 

Pro Glu Ala Leu Gly Gly Leu Gly Gly Ala Ala Ser Ser Gin Leu Leu 
725 730 735 

Pro Ala Val Ala Arg Glu Val Leu Arg Ala Gly Pro Gly Val Pro Glu 
740 745 750 

Leu Arg Ala Ala Gly Glu Arg Met Arg Asp Ala Val Ala Tyr Leu Ala 
755 760 765 

Ala Val Met Trp Gin Trp Pro Leu Arg Gly Leu Arg Ser Ser Ala Ala 
770 775 780 

Arg Gly Ser Gly Trp Pro Asn Arg Val Pro Val Met Pro Leu Gly Cys 
785 790 795 800 

Ala Gly Gly Gly Val Val Ser Gin Arg Trp Ala Met Thr Glu Pro Trp 
805 810 815 

He Ala Phe Pro Pro Glu Val His Ser Ala Met Leu Asn Tyr Gly Ala 
820 825 830 

Gly Val Gly Pro Met Leu He Ser Ala Thr Gin Asn Gly Glu Leu Ser 
835 840 845 

Ala Gin Tyr Ala Glu Ala Ala Ser Glu Val Glu Glu Leu Leu Gly Val 
850 855 860 

Val Ala Ser Glu Gly Trp Gin Gly Gin Ala Ala Glu Ala Leu Val Ala 
865 870 875 880 

Ala Tyr Met Pro Phe Leu Ala Trp Leu He Gin Ala Ser Ala Asp Cys 
885 890 895 

Val Glu Met Ala Ala Gin Gin His Ala Val He Glu Ala Tyr Thr Ala 
900 905 910 

Ala Val Glu Leu Met Pro Thr Gin Val Glu Leu Ala Ala Asn Gin He 
915 920 925 

Lys Leu Ala Val Leu Val Ala Thr Asn Phe Phe Gly He Asn Thr He 
930 935 940 

Pro He Ala He Asn Glu Ala Glu Tyr Val Glu Met Trp Val Arg Ala 
945 950 955 960 

Ala Thr Thr Met Ala Thr Tyr Ser Thr Val Ser Arg Ser Ala Leu Ser 
965 970 975 

Ala Met Pro His Thr Ser Pro Pro Pro Leu He Leu Lys Ser Asp Glu 
980 985 990 

Leu Leu Pro Asp Thr Gly Glu Asp Ser Asp Glu Asp . Gly His Asn His 
995 1000 1005 
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Glv Gly His Ser His Gly Gly His Ala Arg Met lie Asp Asn Phe Phe 
1010 1015 1020 

Ala Glu lie Leu Arg Gly Val Ser Ala Gly Arg He Val Trp Asp Pro 
1025 1030 1035 1040 

Val Asn Gly Thr Leu Asn Gly Leu Asp Tyr Asp Asp Tyr Val Tyr Pro 
1045 1050 1055 

Gly His Ala He Trp Trp Leu Ala Arg Gly Leu Glu Phe Phe Gin Asp 
1060 1065 1070 

Gly Glu Gin Phe Gly Glu Leu Leu Phe Thr Asn Pro Thr Gly Ala Phe 
1075 1080 1085 

Gin Phe Leu Leu Tyr Val Val Val Val Asp Leu Pro Thr His He Ala 
1090 1095 1100 

Gin He Ala Thr Trp Leu Gly Gin Tyr Pro Gin Leu Leu Ser Ala Ala 
1105 1110 1115 1120 

Leu Thr Gly Val He Ala His Leu Gly Ala He Thr Gly Leu Ala Gly 
1125 1130 1135 

Leu Ser Gly Leu Ser Ala He Pro Ser Ala Ala He Pro Ala Val Val 
1140 1145 1150 

Pro Glu Leu Thr Pro Val Ala Ala Ala Pro Pro Met Leu Ala Val Ala 
1155 - 1160 1165 

Gly Val Gly Pro Ala Val Ala Ala Pro Gly Met Leu Pro Ala Ser Ala 
1170 1175 1180 

Pro Ala Pro Ala Ala Ala Ala Gly Ala Thr Ala Ala Gly Pro Thr Pro 
1185 1190 1195 1200 



Pro Ala Thr Gly Phe Gly Gly Leu Pro Ala Leu Pro Gly Arg Arg Trp 
1205 1210 1215 

Arg Pro Arg Asn Arg Val Arg Leu Gly Thr Val Gly Pro Arg Gin Gly 
1220 1225 1230 

Arg Gly Val Arg Phe Arg Cys Ser Arg Val Gly Gly Pro Gly Leu Gly 
1235 1240 1245 

Ala 'cys Ala Gly Ala Cys Cys Thr Ala Gly Pro Leu Gly Gly Lys Ala 
1250 1255 1260 

Arg Gly His Arg Asp Glu Phe 
1265 1270 
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CLAIMS 

1, A method of transforming a slow-growing mycobacterium 
with heterologous DNA, comprising the steps of: 

a) combining a slow-growing mycobacterium and he- 
5 terologous DNA to be transformed into the slow- 
growing mycobacterium, thereby producing a com- 
bination; and 

b) subjecting the combination produced in (a) to 
electroporation in water, under conditions suf- 

10 ficient for introduction of the heterologous DNA 

into slow-growing mycobacterium, 
thereby producing a slow-growing mycobacterium trans- 
formed with the heterologous DNA. 

2. The method of Claim 1 wherein the slow-growing myco- 
15 bacterium of (a) have been exposed to glycine, prior 

to being combined with the heterologous DNA. 

3. The method of Claim 2 wherein the slow-growing myco- 
bacterium are exposed to approximately 1.5% glycine 
present in culture medium in which the slow-growing 

20 mycobacterium is growing, 

4, The method of Claim 1 in which the slow-growing 
mycobacterium is continuously propagated in mid- log 
phase . 
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5. The method of Claim 1 wherein the slow-growing rayco- 
bacterium is selected from the group consisting of: 
M ycoba c t e r ium bovis BCG, Mycobacterium tuberculosis , 
Mycobacterium leprae, Mycobacterium avium , Mvcobacte- 

5 rium af ricanum and Mycobacterium intracellulare . 

6. The method of Claim 5 wherein the heterologous DNA 
comprises DNA homologous to genomic DNA of the slow- 
growing mycobacterium combined in step (a) with the 
heterologous DNA. 

10 7. The method of Claim 6 wherein the heterologous DNA is 
introduced into the uraA locus of the slow-growing 
mycobacterium . 

8. The method of Claim 6 wherein the heterologous DNA 
additionally comprises DNA which is not homologous to 

15 genomic DNA of the slow-growing mycobacterium com- 

bined in step (a) with the heterologous DNA. 

9. The method of Claim 8 wherein the heterologous DNA is 
introduced into the uraA locus of the slow-growing 
mycobacterium . 

20 10. The method of Claim 6 wherein the slow-growing myco- 
bacterium is Mycobacterium tuberculosis and the DNA 
homologous to genomic DNA of the slow-growing myco- 
bacterium is introduced into genomic DNA in a loca- 
tion which results in attenuation of the Mycobacteri- 

25 um tuberculosis . 
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11. The method of Claim 6 wherein the slow-growing myco- 
bacterium is tuberculosis and the heterologous DNA 
comprises DNA homologous to genomic DNA of NL. tuber- 
culosis selected from the group consisting of : the 

5 kat G gene, the HspGO gene, aroA, lysA, uraA and 

tiuberculosis DNA associated with entry into and 
suirvival inside cells. 

12 . A method of producing a heterologously recombinant 
slow- growing mycobacterium having heterologous DNA 

10 incorporated into genomic DNA at a homologous locus, 

comprising the steps of : 

a) combining a slow- growing mycobacterium and he- 
terologous DNA to be transformed into the slow- 
growing mycobacterium, the heterologous DNA 

15 comprising DNA homologous to genomic DNA of the 

slow-growing mycobacterium, thereby producing a 
combination; and 

b) subjecting the combination produced in (a) to 
electroporation in water, under conditions suf- 

20 ficient for introduction of the heterologous DNA 

into the slow-growing mycobacterium and inte- 
gration into genomic DNA of the slow-growing 
mycobacterium at a homologous locus , 
thereby producing a homologously recombinant slow- 
25 growing mycobacterium having heterologous DNA incor- 

porated into its genomic DNA at a homologous locus. 
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The method of Claim 12 wherein the slow- growing myco- 
bacterium of (a) has been exposed to glycine, prior 
to being combined with the heterologous DNA, 

The method of Claim 13 wherein the slow -growing myco- 
bacterium is exposed to approximately 1.5% glycine 
present in culture medium in which the slow-growing 
mycobacterium is growing. 

The method of Claim 12 in which the slow-growing 
mycobacterium is continuously propagated in mid- log 
phase . 

The method of Claim 12 wherein the slow-growing myco- 
bacterium is selected from the group consisting of : 
Mycobacterium bovis BCG, Mycobacterium tubercu losis. 
Mycobacterium leprae . Mycobacterium avium , Mycobacte- 
rium africanum and Mycobacterium intracellulare . 

The method of Claim 12 wherein the heterologous DNA 
additionally comprises DNA which is not homologous to 
genomic DNA of the slow-growing mycobacterium com- 
bined in step (a) with the heterologous DNA. 

The method of Claim 17 wherein the slow-growing 
mycobacterium is Mycobacterium bovis BCG and the DNA 
homologous to genomic DNA of the slow-growing myco- 
bacterium is DNA contained in the Mycobac terium bovis 
BCG orotidine-5- monophosphate decarboxylase gene 
locus or its flanking sequences. 
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19. A method of producing attenuated Mycobacterium tuber- 
culosis . comprising inactivating or deleting in 
Mycobacterium tuberculosis DNA selected from the 
group consisting of: the katG gene, the Hsp60 gene, 

5 aroA, lysA, uraA and DNA associated iwth entry into 

and survival inside cells of Mycobacterium tuberculo- 
sis . 

20. A method of producing attenuated Mycobacterium tuber- 
culosis comprising introducing into tuberculosis 

10 heterologous DNA by: 

a) combining Mycobacterium tuberculosis and heter- 
ologous DNA, thereby producing a combination; 
and 

b) subjecting the combination produced in a) to 

15 electroporation in water, under conditions suf- 

ficient for introduction of the heterologous DNA 
into the Mycobacterium tuberculosis . 
wherein the heterologous DNA is homologous to Myco- 
bacterium tuberculosis genomic DNA selected from the 

20 group consisting of: the kat G gene, the Hsp60 gene, 

aroA, lysA, uraA and M^ tuberculosis DNA associated 
with entry into and survival inside cells and is 
introduced into genomic DNA of the Mycobacterium 
tuberculosis , thereby deleting or disrupting the 

25 corresponding Mycobacterium tuberculosis genomic DNA, 



21 . 



The method of Claim 2 0 wherein the Mycobacterium 
tuberculosis has been exposed to glycine prior to 
being combined with the heterologous DNA. 
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22. The method of Claim 21 wherein the Mycobacterium 

tuberculosis are exposed to approximately 1,5% gly- 
cine present in culture medium in which the Mycobac- 
terium tuberculosis is growing, 

5 23. The method of Claim 20 in which the Mycobacterium 
t ube r cu 1 os i s is continuously propagated in mid- log 
phase . 

24. The method of Claim 20 wherein the heterologous DNA 
additionally comprises DNA which is not homologous to 

10 Mycobacterium tuberculosis genomic DNA. 

25. A DNA construct consisting essentially of: 

a) DNA homologous to genomic DNA of a slow-growing 
mycobacterium which is a genetic marker; and 

b) DNA nonhomologous to genomic DNA of the slow- 
15 growing mycobacterium, 

wherein the DNA nonhomologous to genomic DNA of the 
slow-growing mycobacterium is flanked by the DNA 
homologous to genomic DNA of a slow-growing mycobac- 
terium. 

20 26. The DNA construct of Claim 25 wherein the slow-grow- 
ing mycobacterium is Mycobacterium bovis BCG. and the 
genetic marker is the orotidine-5- monophosphate 
decarboxylase gene locus. 
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27, A homologously recon±)inant slow-growing mycobacterium 
having incorporated therein heterologous DNA homolo- 
gous to genomic DNA of the slow-growing mycobacteri- 
um . 

5 28. The homologously recombinant slow-growing mycobacte- 
rium of Claim 27, wherein the heterologous DNA is 
incorporated into mycobacterial genomic DNA which is 
a genetic marker. 

29. The homologously recombinant slow-growing mycobacte- 
10 rium of Claim 28, wherein the slow-growing mycobacte- 

rium is Mycobacterium bovis BCG and the genetic 
marker is the orotidine-5- monophosphate decarboxyl- 
ase gene locus . 

30. The homologously recombinant slow-growing mycobacte- 
15 rium of Claim 29, wherein the heterologous DNA addi- 
tionally comprises DNA nonhomologous to genomic DNA 
of the slow-growing mycobacterium. 



31. The homologously recombinant slow-growing mycobacte- 
rium of Claim 30, wherein the DNA nonhomologous to 
20 genomic DNA of the slow-growing mycobacterium is DNA 

encoding a protein or polypeptide selected from the 
group consisting of: antigens, enzymes, cytokines, 
lymphokines and immunopotentiators . 
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32. The homologously recombinant slow-growing mycobacte- 
rium of Claim 30, wherein the nonhomologous DNA 
encodes an antigen of a pathogen. 

33. The homologously recombinant slow-growing mycobacte- 
5 rium of Claim 21, wherein the heterologous DNA is not 

expressed and incorporation of the heterologous DNA 
homologous to genomic DNA of the slow-growing myco- 
bacterium into the genomic DNA inactivates or acti- 
vates a gene in the genomic DNA. 

10 34. The homologously recombinant slow-growing mycobacte- 
rium of Claim 27 wherein the slow-growing mycobacte- 
rium is tuberculosis and the gene in the genomic 
DNA is selected from the group consisting of: the 
kat G gene, the HspGO gene, aroA . Ivs A. ura A and 

15 tuberculosis DNA associated with entry into and 

survival inside cells of tuberculosis . 



35. Isolated DNA of mycobacterial origin encoding 
orotidine-5 ' -monophosphate decarboxylase . 

36. Isolated DNA of Claim 35 having the nucleotide se- 
20 quence of SEQ ID No. 1 from nucleotide 16 91 through 

and including nucleotide 2512 . 

37. Isolated orotidine-5 ' -monophosphate decarboxylase of 
mycobacterial origin. 
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38. Isolated orotidine- 5 ' -monophosphate decarboxylase of 
Claim 37 encoded by DNA having the nucleotide se- 
quence of SEQ ID No. 1 from nucleotide 16 91 through 
and including nucleotide 2512, 
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GAGCTCGACCCCGCCGCCGAAACAGAGGTGGCCCCGCAGACCGAAAGGCCCAAGGTGCTG 
E LDPAAETEVAPQTERPKVL 

ATCCTCGGTTCGGGGCCCAATCGGATCGGCCAGGGTATCGAGTTCGACTACAGCTGCGTA 
ILGSGPNRIGQGIEFDYSCV 

CACGCGGCAACCACGTTGAGCCAGGCTGGCTTTGAGACCGTGATGGTCAACTGCAACCCG 
HAATTLSQAGFETVMVNCNP 

GAGACCATGGTGTCCACCGACTTCGACACCGCGGACAGGTTGTACTTCGAGCCGTTGACG 
ET MVSTDFDTADRLYFEPLT 

TTCGAGGACGTCTTGGAGGTCTACCACGCCGAAATGGAATCCGGTAGCGGTGGCCCGGGA 
FEDVLEVYHAEMESGSGGPG 

GTGGCCGGCGTCATCGTGCAGCTCGGCGGCCAGACCCCGCTCGGCTGGCGCACCGGCTCG 
V A GVIVQLGGQTPLGWRTGS 

CCGACGCCGGGTCCCGCTCGTGGGCACCCACCGGAGGCCATCGACCTGGCCGAGGATGCG 
PTPGPARGHPPEAIDLAEDA 

GCCGTTCGGCGACCTGCTGAGCGAGGACTGCCGGCGCCAAAGTACGGCACCGCAACCACT 
AVRRPAERGLPAPKYGTATT 

TTCGCCCAGGCCCGCCGGATCGCCGAGGAGATCGGCTATCCGGTGCTGGTGCGGCCGTCG 
FAQARRIAEEIGYPVLVRPS 

TATGTGCTCGGTGGTCGCGGCATGGAGATCGTGTATGACGAAGAAACGTTGCAGGGCTAC 
YVLGGRGMEIVYDEETLQGY 

ATCACCCGCGCCACTCAGCTATCCCCCGAACACCCGGTGCTCGTGCACCGCTTCCTCGAG 
ITRATQLSPEHPVLVHRFLE 

GACGCGGTCGAGATCGACGTCGACGCTCTGTGTGATGGCGCCGAGGTCTATATCGGCGGA 
DAVEIDVDALCDGAEVYIGG 

ATCATGGAGCACATCGAGGAGGCCGGCATCCACTCCGGTGACTCGGCCTGTGCGCTGCCA 
IMEH IEEAGIHSGDSACAI.P 

CCGGTCACGTTGGGCCGCAGCGACATCGAGAAGGTGCGTAAGGCCACTGAAGCCATTGCG 
PVTLGRSDIEKVRKATEAIA 

CATGGCATCGGCGTGGTGGGGCTGCTCAACGTGCAGTCCGCGCTCAAGGATGACGTGCTC 
HGIGVVGLLNVQSALKDDVL 

TACGTCCTGGAAGCCAACCCGAGAGCGAGCCGTACCGTTCCGTTTGTATCCAAGGCCACA 
YVLEANPRASRTVPFVSKAT 

GCGGTGCCACTCGCCAAGGCATGCGCCCGGATCATGTTGGGCGCCACCATTGCCCAGCTG 
AVPLAKACARIMLGATIAQl* 

CGCGCCGAAGGCTTGCTGGCGGTCACCGGGGATGGCGCCCACGCGGCGCGAAACGCCCCC 
RAEGLLAVTGDGAHAARNAP 

ATCGCGGTCAACCAGGCCGTGTTGCCGTTTCACCGGTTCCGGCGCGCCGACGGGGCCGCC 
lAVNQAVLPFHRFRRADGAA 

ATCGACTCGCTACTCGGCCCGGAGATGAAATCGACCGGCGAGGTGATGGGCATCGACCGC 
IDSLLGPEMKSTGEVMGID R 

GACTTCGGCAGCCGGTTCGCCAAGAGCCAGACCGCCGCCTACGGGTCGCTGCCGGCCCAG 
DFGSRFAKSQTAAYGSLPAQ 

FIGURE 2A 
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GGCACAGTGTTCGTGTCOSTGGCCAACCGGGACAAGCGGTCGCTGGTGTTTCCGG^ 1320 
GTVFVSVANRDKRSLV* fvpw 

CCGATTGGCCCACCTGGGTTTTCGCGTCCTTGCCACCGAAGCACCGCAGAGATC^^ 1380 
RLAHLGFRVLATEAP QRSCA 

GCAACGGTATTCCCTGCGACGACGTCCGCAAACATTTCGAGCCGGCGCAG^ 1440 
ATVFPATTSANl SSRR5 fA* 

CCACAATGTCGGCGGTGGACGCGATCCGAGCCGGCGAGGTCAACATGGTGATCAACACTC 1500 
PQCRRWTRSEPARSTW 

CCTATGGCAACTCCGGTCCGCGCATCGACGGCTATGAGATCCGTTCGGCGGCGGTGGCCG 156 0 

GCAACATCCCGTGCATCACCACGGTGCAGGGCGCATCCGCCGCCGTGCAGGGGATAGAGG 1620 

CCGGGATCCGCGGCGACATCGGGGTGCGCTCCCTGCAGGAGCTGCACCGGGTGATCGGGG 1680 

GCGTCGAGCGGTGACCGGGTTCGGTCTCCGGTTGGCCGAGGCAAAGGCACGCCGCGGCCC 1740 
MTGrGLRLAEAKARRtr*" 

GTTGTGTCTGGGCATCGATCCGCATCCCGAGCTGCTGCGGGGCTGGGATCTGGCGACCAC 1800 
LCLGIDPHPELLRGWDLATl 

GGCCGACGGGCTGGCCGCGTTCTGCGACATCTGCGTACGGGCCTTCGCTGATTTCGCGGT 1860 
ADGLAAFCDICVRAFADFAV 

GGTCAAACCGCAGGTGGCGTTTTTTGAGTCATACGGGGCTGCCGGACTCGCGGTGCT^^ 1920 
V KPQVAFFESYGAAGFAVLE 

GCGCACCATCGCGGAACTGCGGGCCGCAGACGTGCTGGTGTTGGCC6ACGCCAAGCGCGG 1980 
RTIAEIiR AADVLVLADAKRe 

CGACATTGGGGCGACCATGTCGGCGTATGCGACGGCCTGGGTGGGCGACTCGCCGCTGGC 2040 



DIGATMSA 



YATAWVGDS PLA 



CGCCGACGCCGTGACGGCCTCGCCCTATTTGGGCTTCGGTTCGCTGCGGCCGCTGCTAGA 2100 
ADAVTA S PYLGFGSLRPLL-B 

GGTCGCGGCCGCCCACGGCCGAGGGGTGTTCGTGCTGGCGGCCACCTCCAATCCCGAGGG 2160 
VAAAHGRGVFVLAATSNPEG 

TGCGGCGGTGCAGAATGCCGCCGCCGACGGCCGCAGCGTGGCCCAGTTGGTCGTGGACCA 2220 
AAVQNAAADGRSVAQLVVDg 

GGTGGGGGCGGCCAACGAGGCGGCAGGACCCGGGCCCGGATCC^TCGGCGTGGTCGTCGG 2280 
VGAANEAAGPGPGSIGVVVG 

CGCAACGGCGCCACAGGCCCCCGATCTCAGCGCCTTCACCGGGCCGGTGCTGGTGCCCG^ 2340 
ATAPQAPDLSAFTGPVLVPe 

CGTGGGGGTGCAGGGCGGGCGCCCGGAGGCGCTGGGCGGTCTGGGCGGGGCCGCATCGAG 2400 
VGVQGGRPEALGGLGeAAtos 

CCAGCTGTTGCCCGCGGTGGCGCGCGAGGTCTTGCGGGCCGGCCCCGGCGTGCCCGAATT 2460 
QLLPAVAREVLRAGP GVPEL 

GCGCGCCGCGGGCGAACGGATGCGCGATGCCGTCGCCTATCTCGCTGCCGTGTAGCGGGT 2520 
RAAGERMRDAVAYLAAV 

GCCCTGCCACCGCGCCGCTAAATCCCACCAGCATGGGGTGGTGAGCCCAGCGCTCGTGTG 2580 

FIGURE 2B 
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ACCAAACTCACCGCCCTGGGCCGTCGTCACGCTGTGTTAACCTCTCGTTCAAATGATATT 2640 

CATATTCAATAGTGGCGCTAAGTGTCCGGTTGAATCCCCGTTGAACCCCCAACAGATGGA 2700 

GTCTGTGTCGTGACGTTGCGAGTCGTTCCCGAAAGCCTGGCAGGCGCCAGCGCTGCCATC 2760 

GAAGCAGTGACCGCTCGCCTGGCCGCCGCGCACGCCGCGGCGGCCCCGTTTATCGCGGCG 2820 

GTCATCCCGCCTGGGTCCGACTCGGTTTCGGTGTGCAACGCCGTTGAGTTCAGCGTTCAC 2880 

GGTAGTCAGCATGTGGCAATGGCCGCTCAGGGGGTTGAGGAGCTCGGCCGCTCGGGGGTC 2940 
MWQWPL.RGLRSSAARGS 

GGGGTGGCCGAATCGGGTGCCAGTTATGCCGCTAGGATGCGCTGGCGGCGGCGTCGTATC 3000 
GWPNRVPVMPLGCAGGGVVS 

TCAGCGGTGGGCTATGACCGAGCCGTGGATAGCCTTCCCTCCCGAGGTGCACTCGGCGAT 3060 
QRWAMTEPWIAFPPEVHSAM 

GCTGAACTACGGTGCGGGCGTTGGGCCGATGTTGATCTCCGCCACGCAGAATGGGGAGCT 3120 
LNYGAGVGPMIilSATQNGEL 

CAGCGCCCAATACGCAGAAGCGGCATCCGAGGTCGAGGAATTGTTGGGGGTGGTGGCCTC 3180 
SAQYAEAASEVEELLGVVAS 

CGAGGGATGGCAGGGGCAAGCCGCCGAGGCGTTAGTCGCCGCGTACATGCCGTTTCTGGC 3240 
EGWQGQAAEALVAAYMP FLA 

GTGGCTGATCCAAGCCa^GCGCCGACTGCGTGGAAATGGCCGCCCAGCAACACGCCGTCAT 3300 
WLIQASADCVEM AAQQHAVI 

CGAGGCCTACACTGCCGCGGTAGAGCTGATGCCTACTCAGGTCGAACTGGCCGCCAACCA 3360 
EAYTAAVELMPTQVELAANQ 

AATCAAGCTCGCGGTGTTGGTAGCGACCAATTTCTTTGGCATCAACACCATTCCCATTGC 3420 
IKLAVLVATNFFGINTIPIA 

GATCAATGAGGCCGAGTACGTGGAGATGTGGGTTCGGGCCGCCACCACGATGGCGACCTA 3480 
INEA EYVEMWVRAATTMATY 

TTCAACAGTCTCCAGATCGGCGCTCTCCGCGATGCCGCACACCAGCCCCCCGCCGCTGAT 3540 
STVSRSALSAMP HTSPPPLI 

CCTGAAATCCGATGAACTGCTCCCCGACACCGGGGAGGACTCCGATGAAGACGGCCACAA 3600 
LKS DEIiIiPDTGEDSDEDGHN 

CCATGGCGGTCACAGTCATGGCGGTCACGCCAGGATGATCGATAACTTCTTTGCCGAAAT 3660 
HGGHS HGGHARMIDNFFAE I 

CCTGCGTGGCGTCyiGCGCGGGCCGCATTGTTTGGGACCCCGTCAACGGCACCCTCAACGG 3720 
LRGVSAGRIVWDPVNGTliNG 

ACTCGACTACGACGATTACGTCTACCCCGGTCACGCGATCTGGTGGCTGGCTCGAGGCCT 3780 
LDYDDYVYPGHAIWWLARGL 

CGAGTTTTTTCAGGATGGTGAACAATTTGGCGAACTGTTGTTCACCAATCCGACTGGGGC 3840 

EFFQDGEQFGELLFTNPTGA 

TTTTCAGTTCCTCCTCTACGTCGTTGTGGTGGATTTGCCGACGCACATAGCCCAGATCGC 3 900 
FQFLLYVVVVDLPTHIAQIA 

FIGURE 2C 
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TACCTGGCTGGGCCAGTACCCGCAGTTGCTGTCGGCTGCCCTCACTGGCGTCATCGCCCA 3960 
TWLGQYPQLLSAALTGVIAH 

CCTGGGAGCAATAACTGGTFTTGGCGGGCCTATCCGGCCTGAGCGCCATTCCGTCrGCTGC 4020 
LGAlTGLAGLSGIiSAI PSAA 

GATACCCGCCGTTGTACCGGAGCTGACACCCGTCGCGGCCGCGCCGCCTATGTTGGCGGT 4080 
I PAVVPELTPVAAAP PMLAV 

CGCCGGGGTGGGCCCTGCAGTCGCCGCGCCGGGCATGCTCCCCGCCTCAGCACCCGCACC 4140 
AGVGPAVAAPGMLPASAPAP 

GGCGGCAGCGGCCGGCGCCACCGCAGCCGGCCCGACGCCGCCGGCGACTGGTTTCGGAGG 4200 
AAAAGATAAGPTPPATGFGG 

GCTTCCCGCCCTACCTGGTCGGCGGTGGCGGCCCAGGAATAGGGTTCGGCTCGGGACAGT 4260 
LPALPGRRWRPRNRVRIiGTV 

CGGCCCACGCCAAGGCCGCGGCGTCCGATTCCGCTGCAGCCGAGTCGGCGGCCCAGGCCT 4320 
GPRQGRGVRFRCSRVGGPGL 

CGGCGCGTGCGCAGGCGCGTGCTGCACGGCGGGGCCGCTCGGCGGCAAGGCACGTGGCCA 4380 
GACAGACCTAGPLGGKARGH 

TCGTGACGAATTC 4293 
R D E F 
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